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I. REAL PARTY IN INTEREST 

The real party in interest is Genentech, Inc., South San Francisco, California, by an 
assignment of the parent application, U.S. Patent Application Serial No. 09/665,350 recorded 
July 9, 2001, at Reel 01 1964 and Frame 0181 . The present application is a continuation of U.S. 
Serial No. 09/665,350. 

II. RELATED APPEALS AND INTERFERENCES 

The claims pending in the current application are directed to a polypeptide referred to 
herein as "PR0232". There exists one related patent application, U.S. Serial No. 09/906,777, 
filed July 16, 2001 (containing claims directed to nucleic acids encoding PR0232 polypeptides). 

III. STATUS OF CLAIMS 

Claims 44-46 and 49-51 are in this application. 
Claims 1-44 and 47-48 have been canceled. 

Claims 44-46 and 49-51 stand rejected and Appellants appeal the rejection of these 
claims. ' 

A copy of the rejected claims in the present Appeal is provided as Appendix A. 

IV. STATUS OF AMENDMENTS 

There were no amendments submitted after the final rejection mailed October 21, 2005. 
All previous amendments have been entered. 

V. SUMMARY OF CLAIMED SUBJECT MATTER 

The invention claimed in the present application is related to an isolated polypeptide 
comprising the amino acid sequence of the polypeptide of SEQ ID NO: 18, referred to in the 
present application as "PR0232." The PR0232 gene was shown for the first time in the present 
application to be significantly amplified in human lung cancers as compared to normal, 
non-cancerous human tissue controls (Example 92). This feature is specifically recited in claim 
124, and carried by all claims dependent from claim 44. In addition, the invention also claims 
the amino acid sequence of the polypeptide of SEQ ID NO: 18, lacking its associated signal- 
peptide; or the amino acid sequence of the polypeptide encoded by the full-length coding 



sequence of the cDNA deposited under ATCC accession number 209250 (Claim 44-46 and 49). 
The invention is further directed to a chimeric polypeptide comprising one of the above 
polypeptides fused to a heterologous polypeptide (Claim 50), and to a chimeric polypeptide 
wherein the heterologous polypeptide is an epitope tag or an Fc region of an immunoglobulin 
(Claim 51). The preparation of chimeric PRO polypeptides (claims 50 and 51), including those 
wherein the heterologous polypeptide is an epitope tag or an Fc region of an immunoglobulin, is 
set forth in the specification at page 74, lines 23 to page 75, line 5. Examples 53-56, pages 192- 
199, describe the expression of PRO polypeptides in various host cells, including E. coli, 
mammalian cells, yeast and Baculovirus-infected insect cells. 

The amino acid sequence of the "PR0232" polypeptide and the nucleic acid sequence 
encoding this polypeptide (referred to in the present application as "DNA34435-1 140") are 
shown in the present specification as SEQ ID NOs: 18 and 17, respectively, and in Figures 9 and 
8, described on page 59, lines 4-7. The full-length PR0232 polypeptide having the amino acid 
sequence of SEQ ID NO: 18 is described in the specification at, for example, on page 4, pages 3- 
4 and page 100, page 131, line 9 to 16 and the isolation of cDNA clones encoding PR0232 of 
SEQ ID NO: 18 is described in Example 4, page 149-150 of the specification. The specification 
discloses that the PR0232 polypeptide possess significant sequence homology to cell surface 
stem cell antigen (35% sequence identity with a stem cell surface antigen from Gallus gallus) 
and may play a role in cell proliferation and/or differentiation, (see for example, page 4 and 
Example 4, line 14-15). 

Finally, Example 92, in the specification at page 222, line 26, to page 235, line 3, sets 
forth a 'Gene Amplification assay' which shows that the PR0232 gene is amplified in the 
genome of certain human lung cancers (see Table 9, pages 230-234). The profiles of various 
primary lung and colon tumors used for screening the PRO polypeptide compounds of the 
invention in the gene amplification assay are summarized on Table 8, page 227 of the 
specification. 
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VI. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

1 . Whether instant Claims 44-46 and 49-5 1 satisfy the utility/enablement 
requirement under 35 U.S.C. §§101/112, first paragraph. 

2. Whether Claims 44-46 and 49-51 are entitled to the priority date of U.S. 
Provisional Application 60/059121, filed September 17, 1997. 

3. Whether Claims 44-46 and 49-5 1 are anticipated under 35 U.S.C. § 102(b) by 
Rosenthal et al., DE19818619-A1 (October 1999). 

VII. ARGUMENTS 

Summary of the Arguments 

Issue 1 : Utility 

The sole basis for the Examiner's rejection of the instant Claims 44-46 and 49-51 is 
allegedly because the subject matter does not satisfy the conditions under 35 U.S.C. §§ 101/1 12, 
first paragraph. 

Appellants have previously submitted that patentable utility for the PR0232 polypeptides 
is based upon the gene amplification data for the gene encoding the PR0232 polypeptide. The 
specification discloses that the gene encoding PR0232 showed significant amplification, ranging 
from 2.056-fold to 5.28-fold, in five lung tumors or 2.00-fold to 5.32-fold in seven colon tumors . 
Thus, Appellants submit that the PR0232 polypeptide is useful as a marker for the diagnosis of 
lung or colon cancer , and for monitoring cancer development and/or for measuring the efficacy 
of cancer therapy. 

The Examiner says that "(w)hile one can find prior art that supports a "significant 
probability" that mRNA and protein levels will correlate, there is influential art in the record that 
requires the Examiner to maintain that, as a whole, the prior art does not provide a reasonable 
expectation that expression of the nucleic acid of SEQ expression" (Page 3 , lines 1 1 - of the 
Final Office Action mailed October 21, 2005), and bases this rejection on the teachings of 
Pennica et al, Haynes et al, Konopka et al, Hu et al, Lian et al and Fessler et al 

Appellants respectfully disagree and submit that the teachings of Pennica et al, Haynes 
et al, Konopka et al, Hu et al, Lian et al and Fessler et al do not conclusively establish a 
prima facie case for lack of utility because the references are, either not contrary to the 
Appellants' arguments, or, actually lend support to the Appellants' position, or are not applicable 
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to the present application due to limitations in the study, as explained in detail below. On the 
other hand, Appellants add that while the literature indicates that some references demonstrate a 
positive correlation between mRNA expression and protein levels, while some show no 
correlation, there are more cases in literature that show a positive correlation than not . 

Appellants have submitted ample evidence to show that in general, if a gene is amplified 
in cancer, it is more likely than not that the encoded protein will be expressed at an elevated 
level. First, the articles by Orntoft et al. 9 Hyman et al. 9 and Pollack et al (made of record in 
Appellants' Response filed August 9, 2004) collectively teach that in general gene amplification 
increases mRNA expression . Second, the Declaration of Dr. Paul Polakis (made of record in 
Appellants' Response filed August 9, 2004), principal investigator of the Tumor Antigen Project 
of Genentech, Inc., the assignee of the present application, shows that, in general there is a 
correlation between mRNA levels and polypeptide levels . Appellants further note that the sale 
of gene expression chips to measure mRNA levels is a highly successful business, with a 
company such as Affymetrix recording 168.3 million dollars in sales of their GeneChip arrays in 
2004. Clearly, the research community believes that the information obtained from these chips is 
useful (i.e., that it is more likely than not informative of the protein level). 

Taken together, although there are some examples in the scientific art that do not fit 
within the central dogma of molecular biology that there is a correlation between DNA, mRNA, 
and polypeptide levels, Appellants maintain that, in general, there is a positive correlation 
between mRNA and protein levels, as exemplified by the teachings within a larger proportion if 
the references available in the art. Therefore, one of skill in the art would reasonably expect in 
this instance, based on the amplification data for the PR0232 gene, that the PR0232 polypeptide 
is concomitantly overexpressed, and that it has utility in the diagnosis of lung or colon cancer. 

Appellants further submit that even if, in this instance, there were no correlation between 
gene amplification and increased mRNA/protein expression, (which Appellants expressly do not 
concede), a polypeptide encoded by a gene that is amplified in cancer would still have a specific, 
substantial, and credible utility. Appellants submit that, as evidenced by the Ashkenazi 
Declaration and the teachings of Hanna and Mornin (both made of record in Appellants' 
Response filed August 9, 2004), simultaneous testing of gene amplification and gene product 
over-expression enables more accurate tumor classification , even if the gene-product, the protein, 
is not over-expressed. This leads to better determination of a suitable therapy for the tumor, as 
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demonstrated by a real-world example of the breast cancer marker HER-2/neu. Accordingly, 
Appellants submit that when the proper legal standard is applied, one should reach the 
conclusion that the present application discloses at least one patentable utility for the claimed 
PR0232 polypeptides. 

Therefore, Appellants submit that the instant Claims 44-46 and 49-51 satisfy 35 U.S.C. 
§§ 101/1 12, first paragraph and hence these rejections should be withdrawn. 

Issue 2: Priority 

The instant application has not been granted the earlier priority date of U.S. Provisional 
Application 60/059121, filed September 17, 1997 on the grounds that the 60/059121 application 
fails to provide a utility and lacks an enabling disclosure for the claimed invention under 35 
U.S.C. §§ 101/1 12, first paragraph. (Page 14 of the Final Office Action mailed October 21, 
2005). Appellants submit that, for the same reasons discussed above under Issue 1, U.S. 
Provisional Application 60/059121 also satisfies the utility requirements. Therefore, Appellants 
should be entitled to the priority date of September 17, 1997. 

Issue 3 : Anticipation bv Rosenthal et al. 

As discussed above under Issue 2, the present application should be entitled to the earlier 
filing date of September 17, 1997 and therefore, Rosenthal et al, DE19818619-A1, dated 
October 1999, is not prior art. Thus the instant claims are not anticipated by Rosenthal et al. 

These arguments are all discussed in further detail below under the appropriate headings. 
Response to Rejections 

ISSUE 1: The Instant Claims 44-46 and 49-51 Satisfy the Utility Requirement under 35 
U.S.C. $ 101 / S 112, First Paragraph based on the results of the gene amplification assay 

The sole basis for the Examiner's rejection of Claims 44-46 and 49-51 under this section 
is that the data presented in the instant Application, allegedly, does not satisfy the requirements 
of 35 U.S.C. §§ 101/112, first paragraph. Appellants strongly disagree for the reasons discussed 
below. 
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A. The Legal Standard For Utility Under 35 U.S.C. § 101 

According to 35 ILS.C. § 101: 

Whoever invents or discovers any new and useful process, machine, manufacture, or 
composition of matter, or any new and useful improvement thereof, may obtain a patent 
therefor, subject to the conditions and requirements of this title. (Emphasis added.) 
In interpreting the utility requirement, in Brenner v. Manson the Supreme Court held 

that the quid pro quo contemplated by the U.S. Constitution between the public interest and the 

interest of the inventors required that a patent applicant disclose a "substantial utility" for his or 

2 

her invention, i.e. a utility "where specific benefit exists in currently available form." The Court 
concluded that "a patent is not a hunting license. It is not a reward for the search, but 
compensation for its successful conclusion. A patent system must be related to the world of 

3 

commerce rather than the realm of philosophy." 

4 

Later, in Nelson v. Bowler the C.C.P.A. acknowledged that tests evidencing 
pharmacological activity of a compound may establish practical utility, even though they may 
not establish a specific therapeutic use. The court held that "since it is crucial to provide 
researchers with an incentive to disclose pharmaceutical activities in as many compounds as 
possible, we conclude adequate proof of any such activity constitutes a showing of practical 

utility." 5 

In Cross v. Iizuka the C.A.F.C. reaffirmed Nelson, and added that in vitro results might 
be sufficient to support practical utility, explaining that "in vitro .testing, in general, is relatively 
less complex, less time consuming, and less expensive than in vivo testing. Moreover, in vitro 
results with the particular pharmacological activity are generally predictive of in vivo test results, 

1 Brenner v. Manson, 383 U.S. 519, 148 U.S.P.Q. (BNA) 689 (1966). 

2 Id. at 534, 148 U.S.P.Q. (BNA) at 695. 

3 Id. at 536, 148 U.S.P.Q. (BNA) at 696. 

4 Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. (BNA) 881 (C.C.P.A. 1980). 

5 Id. at 856, 206 U.S.P.Q. (BNA) at 883. 

6 Cross v. Iizuka, 753 F.2d 1047, 224 U.S.P.Q. (BNA) 739 (Fed. Cir. 1985). 



i.e. there is a reasonable correlation there between." The court perceived "No insurmountable 
difficulty" in finding that, under appropriate circumstances, "in vitro testing, may establish a 

8 

practical utility." 

The case law has also clearly established that Appellants 5 statements of utility are usually 

9 ... 

sufficient, unless such statement of utility is unbelievable on its face. The PTO has the initial 

10 

burden to prove that Appellants' claims of usefulness are not believable on their face. In 
general, an Applicant's assertion of utility creates a presumption of utility that will be sufficient 
to satisfy the utility requirement of 35 U.S.C. §101, "unless there is a reason for one skilled in 

11 12 

the art to question the objective truth of the statement of utility or its scope. ' 

13 

Compliance with 35 U.S.C. §101 is a question of fact. The evidentiary standard to be 
used throughout ex parte examination in setting forth a rejection is a preponderance of the 

14 

totality of the evidence under consideration. Thus, to overcome the presumption of truth that 
an assertion of utility by the applicant enjoys, the Examiner must establish that it is more likely 
than not that one of ordinary skill in the art would doubt the truth of the statement of utility. 
Only after the Examiner made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the applicant. The issue will then be decided on the totality of evidence. 

7 Id. at 1050, 224 U.S.P.Q. (BNA) at 747. 

8 u. 

9 In re Gazave, 379 F.2d 973, 154 U.S.P.Q. (BNA) 92 (C.C.P.A. 1967). 

10 Ibid. 

U In reLanger, 503 F.2d 1380,1391, 183 U.S.P.Q. (BNA) 288, 297 (C.C.P.A. 1974). 

12 See also In re Jolles, 628 F.2d 1322, 206 USPQ 885 (C.C.P.A. 1980); In re Irons, 340 
F.2d 974, 144 USPQ 351 (1965); In re Sichert, 566 F.2dll54, 1159, 196 USPQ 209,212-13 
(C.C.P.A. 1977). 

13 Raytheon v. Roper, 724 F.2d 951, 956, 220 U.S.P.Q. (BNA) 592, 596 (Fed. Cir. 1983) 
cert, denied, 469 US 835 (1984). 

14 In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d (BNA) 1443, 1444 (Fed. Cir. 

1992). 
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The well established case law is clearly reflected in the Utility Examination Guidelines 

("Utility Guidelines") 15 , which acknowledge that an invention complies with the utility 
requirement of 35 U.S.C. §101, if it has at least one asserted "specific, substantial, and credible 
utility" or a "well-established utility." Under the Utility Guidelines, a utility is "specific" when 
it is particular to the subject matter claimed. For example, it is generally not enough to state that 
a nucleic acid is useful as a diagnostic without also identifying the conditions that are to be 
diagnosed. 

In explaining the "substantial utility" standard, M.P.E.P. §2107.01 cautions, however, 
that Office personnel must be careful not to interpret the phrase "immediate benefit to the 
public" or similar formulations used in certain court decisions to mean that products or services 
based on the claimed invention must be "currently available" to the public in order to satisfy the 
utility requirement. "Rather, any reasonable use that an applicant has identified for the invention 
that can be viewed as providing a public benefit should be accepted as sufficient, at least with 
regard to defining a 'substantial' utility." 16 Indeed, the Guidelines for Examination of 
Applications for Compliance With the Utility Requirement/ 7 gives the following instruction to 
patent examiners: "If the applicant has asserted that the claimed invention is useful for any 
particular practical purpose . . . and the assertion would be considered credible by a person of 
ordinary skill in the art, do not impose a rejection based on lack of utility." 

B. Proper Application of the Legal Standard 

Appellants respectfully submit that the data presented in Example 92 starting on page 222 
of the priority application and the cumulative evidence of record, which underlies the current 
dispute, indeed support a "specific, substantial and credible" asserted utility for the presently 
claimed invention. 

Patentable utility for the PR0232 polypeptides is based upon the gene amplification data 
for the gene encoding the PR0232 polypeptide. Example 92 describes the results obtained using 

15 66 Fed. Reg. 1092(2001). 

16 M.P.E.P. §2107.01. 

17 M.P.E.P. §2107 II (B)(1). 
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a very well-known and routinely employed polymerase chain reaction (PCR)-based assay, the 
TaqMan™ PCR assay, also referred to herein as the gene amplification assay. This assay allows 
one to quantitatively measure the level of gene amplification in a given sample, say, a tumor 
extract, or a cell line. It was well known in the art at the time the invention was made that gene 
amplification is an essential mechanism for oncogene activation. Appellants isolated genomic 
DNA from a variety of primary cancers and cancer cell lines that are listed in Table 9 (pages 222 
onwards of the specification), including primary lung and colon cancers of the type and stage 
indicated in Table 8 (page 227). The tumor samples were tested in triplicates with Taqman™ 
primers and with internal controls, beta-actin and GAJDPH in order to quantitatively compare 
DNA levels between samples (page 229). As a negative control, DNA was isolated from the 
cells often normal healthy individuals, which was pooled and used as a control (page 222, lines 
28-29). The results of TaqMan™ PCR are reported in ACt units, as explained in the passage on 
page 222, lines 37-39. One unit corresponds to one PCR cycle or approximately a 2-fold 
amplification, relative to control, two units correspond to 4-fold, 3 units to 8-fold amplification 
and so on. Using this PCR-based assay, Appellants showed that the gene encoding for PR0232 
was amplified, that is, it showed approximately 1.04-2.40 ACt units for five lung tumors and 
1.00-2.41 ACt units for seven colon tumors, which corresponds to 2 1 04 ^2 2 40 - fold amplification 
in lung or to 2 1 00 -2 2 41 - fold amplification in colon tumors; that is 2.056-fold to 5.28-fold, in five 
lung tumors or 2.00-fold to 5.32-fold in seven colon tumors , which would be considered 
significant and credible by one skilled in the art. Therefore, the PR0232 gene and the PR0232 
polypeptide are important diagnostic markers to identify such malignant lung or colon cancers. 

The Examiner says that "(w)hile one can find prior art that supports a 'significant 
probability 5 that mRNA and protein levels will correlate, there is influential art in the record that 
requires the Examiner to maintain that, as a whole, the prior art does not provide a reasonable 
expectation that expression of the nucleic acid of SEQ expression" (Page 3, lines 1 1 - of the 
Final Office Action mailed October 21, 2005), and bases this rejection on the teachings of 
Pennica et al, Haynes et al, Konopka et al, Hu et al, Lian et al and Fessler et al 

Appellants strongly disagree. The evidentiary standard to be used throughout ex parte 
examination of a patent application is a preponderance of the totality of the evidence under 
consideration. Thus, to overcome the presumption of truth that an assertion of utility by the 
applicant enjoys, the Examiner must establish that it is more likely than not that one of ordinary 
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skill in the art would doubt the truth of the statement of utility. Only after the Examiner has 
made a proper prima facie showing of lack of utility, does the burden of rebuttal shift to the 
applicant. Accordingly, it is not a legal requirement to establish a necessary correlation between 
an increase in the copy number of the DNA and protein expression levels nor is it imperative to 
find evidence that DNA amplification is " necessarily " or "always" associated with 
overexpression of the gene product. Appellants respectfully submit that when the proper 
evidentiary standard is applied, a correlation must be acknowledged. 

Appellants further submit that the teachings of Pennica et al y Haynes et al t Konopka et 
al, Hu et al, Lian et al and Fessler et al do not conclusively establish a prima facie case for 
lack of utility because the references are, either not contrary to the Appellants' arguments, or, 
actually lend support to the Appellants 5 position, as explained in the discussions below. 

Pennica et al* Konopka et al. and Haynes et al 

The teachings of Pennica et al are specific to WISP genes, a specific class of closely 
related molecules. Pennica et al showed that there was good correlation between DNA and 
mRNA expression levels for the WISP-1 gene but not for WISP-2 and WISPS genes. But, the 
fact that in the case of closely related molecules, there seemed to be no correlation between gene 
amplification and the level of mRNA/protein expression does not establish that it is more likely 
than not, in general, that such correlation does not exist. As discussed above, the standard is not 
absolute certainty . Pennica et al has no teaching whatsoever about the correlation of gene 
amplification and protein expression for genes in general . Indeed, the working hypothesis 
among those skilled in the art is that, if a gene is amplified in cancer, the encoded protein is 
likely to be expressed at an elevated level. In fact, as noted even in Pennica et al, "[a]n analysis 
of WISP-l gene amplification and expression in human colon tumors showed a correlation 
between DNA amplification and over-expression ..." (Pennica et al, page 14722, left column, 
first full paragraph, emphasis added). Accordingly, Appellants respectfully submit that Pennica 
et al teaches nothing conclusive regarding the absence of correlation between gene amplification 
and over-expression of mRNA or polypeptides in most genes, in general. Pennica et al has no 
teaching whatsoever about the correlation of gene amplification and protein expression for genes 
in general . 
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Similarly, in Konopka et al, Appellants submit that the Examiner has generalized a very 
specific result disclosed by Konopka et al to cover all genes. Konopka et al actually state that 
"[p]rotein expression is not related to amplification of the abl gene but to variation in the level of 
bcr-abl mRNA produced from a single Ph 1 template." (See Konopka et al, Abstract, emphasis 
added). The paper does not teach anything whatsoever about the correlation of protein 
expression and gene amplification in general and provides no basis for the generalization that 
apparently underlies the present rejection. The statement of Konopka et al that "[p]rotein 
expression is not related to amplification of the abl gene ..." is not sufficient to establish a 
prima facie case of lack of utility. Therefore, the combined teachings of Pennica et al and 
Konopka et al are not directed towards genes in general but to a single gene or genes within a 
single family and thus, their teachings cannot support a general conclusion regarding correlation 
between gene amplification and mRNA or protein levels. 

Actually, the cited reference Haynes et al, showed that " there was a general trend, 
although no strong correlation between protein [expression] and transcript levels." (see Figure 1 
and page 1863, paragraph 2.1, lastTine). Therefore, when the proper legal standard is used, 
Haynes clearly supports the Appellants' position. This is all that's needed to meet the "more 
likely than not" evidentiary standard. Again, accurate prediction is not the standard . Therefore, 
a prima facie case of lack of utility has not been met based on the cited references Pennica et al, 
Konopka et al and Haynes et al 

Hu et al 

Appellants respectfully submit that, contrary to the Examiner's assertion, the cited Hu et 
al reference does not conclusively establish a prima facie case for lack of utility for the PRO290 
molecule. The Hu et al reference is entitled "Analysis of Genomic and Proteomic Data using 
Advanced Literature Mining" (emphasis added). Therefore, as the title itself suggests, the 
conclusions in this reference are based upon statistical analysis of information obtained from 
published literature, and not from experimental data. Hu et al performed statistical analysis to 
provide evidence for a relationship between mRNA expression and biological function of a given 
molecule (as in disease). The conclusions of Hu et al however, only apply to a specific type of 
breast tumor (estrogen receptor (ER)-positive breast tumor) and cannot be generalized to breast 
cancer genes in general, let alone to cancer genes in general. Interestingly, the observed 

12 



correlation was only found among ER-positive (breast) tumors not ER-negative tumors." (See 
page 412, left column). 

Moreover, the analytical methods utilized by Hu et al have certain statistical drawbacks, 
as the authors themselves admit. For instance, according to Hu et al, "different statistical 
methods" were applied to "estimate the strength of gene-disease relationships and evaluated the 
results." (See page 406, left column, emphasis added). Using these different statistical methods, 
Hu et al "[a]ssessed the relative strengths of gene-disease relationships based on the frequency 
of both co-citation and single citation." (See page 41 1, left column). As is well known in the 
art, different statistical methods allow different variables to be manipulated to affect the resulting 
outcome. In this regard, the authors disclose that, "Initial attempts to search the literature " using 
the list of genes, gene names, gene symbols, and frequently used synonyms generated by the 
authors "revealed several sources of false positives and false negatives." (See page 406, right 
column). The authors add that the false positives caused by "duplicative and unrelated meanings 
for the term" were "difficult to manage." Therefore, in order to minimize such false positives, 
Hu et al disclose that these terms "had to be eliminated entirely, thereby reducing the false 
positive rate but unavoidably under-representing some genes." Id. (Emphasis added). Hence, 
Hu et al had to manipulate certain aspects of the input data, in order to generate, in their opinion, 
meaningful results. Further, because the frequency of citation for a given molecule and its 
relationship to disease only reflects the current research interest of a molecule, and not the true 
biological function of the molecule, as the authors themselves acknowledge, the "[relationship 
established by frequency of co-citation do not necessarily represent a true biological link." (See 
page 411, right column). Therefore, based on these findings, the authors add, "[t]his may reflect 
a bias in the literature to study the more prevalent type of tumor in the population. Furthermore, 
this emphasizes that caution must be taken when interpreting experiments that may contain 
subpopulations that behave very differently." Id. (Emphasis added). In other words, some 
molecules may have been underrepresented merely because they were less frequently cited or 
studied in literature compared to other more well-cited or studied genes. Therefore, Hu et al 's 
conclusions are not based on genes/mRNA in general 

Therefore, Appellants submit that, based on the nature of the statistical analysis 
performed herein, and in particular, based on Hu's analysis of one class of genes, namely, the 
estrogen receptor (ER)-positive breast tumor genes, the conclusions drawn by the Examiner, 
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namely that, "genes displaying a 5-fold change or less (mRNA expression) in tumors compared 
to normal showed no evidence of a correlation between altered gene expression and a known role 
in the disease (in general)" is not reliably supported. 

Lian et al. and Fessler et al. 

The Examiner further cites new references by Lian et al and Fessler et al in support of 
her interpretation that "protein levels cannot be accurately predicted from the level of the 
corresponding mRNA transcript." (Page 4 of the instant Final Office Action mailed October 21, 
2005). 

Appellants respectfully submit that Lian et al only teach that protein expression may not 
correlate mRNA level in differentiating myeloid cells and does not teach anything of such a lack 
of correlation for genes in general . In fact, the authors themselves admit that there were a 
number of problems with their data . For instance, at page 520 of this article, the authors 
explicitly express their concerns regarding the methods they utilized and the interpretation of 
their data stating that " |"t]hese data must be considered with several caveats: membrane and other 
hydrophobic proteins and very basic proteins are not well displayed by the standard 2DE 
approach, and proteins presented at low level will be missed. In addition, to simplify MS 
anlysis, we used a Coomassie dye stain rather than silver to visualize proteins, and this decreased 
the sensitivity of detection of minor proteins. " (Emphasis added). Appellants submit, as is well- 
known in the art, the Coomassie dye staining method is a very insensitive method of measuring 
protein. Therefore, the conclusions based on such measurements would hardly be considered 
accurate by the skilled artisan, or at least, would not be extrapolated to generally reflect the gene: 
mRNA/ protein relationships for proteins in general. Therefore, even if the teachings of Lian et 
al reflects a lack of correlation between the genes and mRNA/ proteins in differentiating 
myeloid cells (which Appellants submit is not a representative sample of genes in general since 
only pertain genes are expressed during differentiation), their conclusions are based on a widely 
accepted, insensitive method for protein staining, namely, the Coomassie dye staining, which 
cannot be applied to genes in general. 

Similarly, in Fessler et al, Appellants submit that the PTO has overlooked a number of 
limitations in their, which the authors themselves acknowledge. For instance, Fessler et al only 
examined lipopoysaccharide-activated neutrophilins, so, as with Lian et al, only examined the 
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expression level of a few proteins/RNAs in response to LPS stimulation. Fessler et al also 
concede that, since they used the Coomassie Blue dye staining method, which is known to have a 
limited protein binding range and a non-linear curve for protein detection, the resulting image 
analysis of the Coomassie Blue-stained proteins ought to be considered as semi-quantitative only 
(see page 31301, col. 1). Further, Fessler et al submit that protein identification in their study 
was done using two-dimensional PAGE but admit that the analysis was limited only to well- 
resolved regions of the gel , which Fessler et al explicitly concede, tends to select for more 
abundant protein species and therefore, may have performed less well with hydrophobic and high 
molecular weight proteins (see page 31301, col. 1). In addition, the Fessler et al paper also 
indicates that the harvesting of the LPS-incubated PMNs at 4 hours may have prevented the 
detection of early, transiently appearing proteins, and that the process of post-LPS incubation 
and pre-two-dimensional PAGE cell washes would expectedly remove secreted proteins from 
further analysis, perhaps contributing to the observed transcript-protein discordance. Therefore, 
the Fessler et al reference explains the reasons for their transcript-protein discordance and like 
the Lian et al reference, cannot be relied upon to make a general proposition that protein levels 
cannot be accurately predicted from mRNA levels. 

Therefore, when the proper legal standard is used, a prima facie case of lack of utility has 
not been met based on the cited references Lian et al and Fessler et al Indeed, the working 
hypothesis among those skilled in the art is that, if a gene is amplified in cancer, the encoded 
protein is likely to be expressed at an elevated level. 

On the contrary, Appellants submit that gene amplification assay in the specification 
further discloses that, "(a)mplification is associated with overexpression of the gene product, 
indicating that the polypeptides are useful targets for therapeutic intervention in certain cancers 
such as colon, lung, breast and other cancers and diagnostic determination of the presence of 
those cancers" (emphasis added). Besides, Appellants have submitted ample evidence 
(discussed below) to show that, in general, if a gene is amplified in cancer, it is "more likely than 
not" likely that the encoded protein will also be expressed at an elevated level. 

For support, Appellants presented the articles by Orntoft et al, Hyman et al, and Pollack 
et al (made of record in Appellants' Response filed August 9, 2004), who collectively teach that 
in general for most genes, DNA amplification increases mRNA expression . The results 
presented by Orntoft et al, Hyman et al, and Pollack et al are based upon wide ranging 
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analyses of a large number of tumor associated genes. Orntoft et al studied transcript levels of 
5600 genes in malignant bladder cancers, many of which were linked to the gain or loss of 
chromosomal material, and found that in general (18 of 23 cases) chromosomal areas with more 
than 2-fold gain of DNA showed a corresponding increase in mRNA transcripts. Hyman et al 
compared DNA copy numbers and mRNA expression of over 12,000 genes in breast cancer 
tumors and cell lines, and found that there was evidence of a prominent global influence of copy 
number changes on gene expression levels. In Pollack et al, the authors profiled DNA copy 
number alteration across 6,691 mapped human genes in 44 predominantly advanced primary 
breast tumors and 10 breast cancer cell lines, and found that on average, a 2-fold change in DNA 
copy number was associated with a corresponding 1.5-fold change in mRNA levels. In summary, 
the evidence supports the Appellants' position that gene amplification is more likely than not 
predictive of increased mRNA and polypeptide levels. 

Second, the Declaration of Dr. Paul Polakis (made of record in Appellants' Response 
filed August 9, 2004), principal investigator of the Tumor Antigen Project of Genentech, Inc., 
the assignee of the present application, explains that in the course of Dr. Polakis' research using 
microarray analysis, he and his co-workers identified approximately 200 gene transcripts that are 
present in human tumor cells at significantly higher levels than in corresponding normal human 
cells. Appellants submit that Dr. Polakis' Declaration was presented to support the position that 
there is a correlation between mRNA levels and polypeptide levels, the correlation between gene 
amplification and mRNA levels having already been established by the data shown in the Orntoft 
et al, Hyman et al, and Pollack et al articles. Appellants further emphasize that the opinions 
expressed in the Polakis Declaration, including in the above quoted statement, are all based on 
factual findings. For instance, antibodies binding to about 30 of these tumor antigens were 
prepared, and mRNA and protein levels were compared. In approximately 80% of the cases , the 
researchers found that increases in the level of a particular mRNA correlated with changes in the 
level of protein expressed from that mRNA when human tumor cells are compared with their 
corresponding normal cells . Therefore, Dr. Polakis' research, which is referenced in his 
Declaration, shows that, in general there is a correlation between increased mRNA and 
polypeptide levels . 

Appellants further note that the sale of gene expression chips to measure mRNA levels is 
a highly successful business, with a company such as Affymetrix recording 168.3 million dollars 
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in sales of their GeneChip® arrays in 2004. Clearly, the research community believe that the 
information obtained from these chips is useful (i.e., that it is more likely than not that the results 
are informative of protein levels). 

Taken together, all of the submitted evidence supports the Appellants' position that, in 
the majority of amplified genes , increased gene amplification levels, more likely than not, predict 
increased mRNA and polypeptide levels, which clearly meets the utility standards described 
above. Hence, one of skill in the art would reasonably expect that, based on the gene 
amplification data of the PR0232 gene, the PR0232 polypeptide is concomitantly overexpressed 
in the lung or colon tumors studied as well. 

Appellants further submit that, even if there were no correlation between gene 
amplification and increased mRNA/protein expression, (which Appellants expressly do not 
concede), a polypeptide encoded by an amplified gene in cancer would still have a specific, 
substantial, and credible utility as explained below. As the Declaration of Dr. Avi Ashkenazi 
(submitted with Appellants' Response filed December 10, 2003) explains: 

"even when amplification of a cancer marker gene does not result in significant over- 
expression of the corresponding gene product, this very absence of gene product over- 
expression still provides significant information for cancer diagnosis and treatment." 

Thus, even if over-expression of the gene product does not parallel gene amplification in 
certain tumor types, parallel monitoring of gene amplification and gene product over-expression 
enables more accurate tumor classification and hence better determination of suitable therapy. In 
addition, absence of over-expression is crucial information for the practicing clinician. If a gene 
is amplified in a tumor, but the corresponding gene product is not over-expressed, the clinician 
will decide not to treat a patient with agents that target that gene product. This not only saves 
money, but also has the benefit that the patient can avoid exposure to the side effects associated 
with such agents. 

This utility is further supported by the teachings of the article by Hanna and Mornin. 

(Pathology Associates Medical Laboratories, August (1999), submitted with the Response filed 

August 9, 2004). The article teaches that the HER-2/neu gene has been shown to be amplified 

and/or over-expressed in 10%-30% of invasive breast cancers and in 40%-60% of intraductal 

breast carcinomas. Further, the article teaches that diagnosis of breast cancer includes testing 

both the amplification of the HER-2/neu gene (by FISH) as well as the over-expression of the 
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HER-2/neu gene product (by IHC). Even when the protein is not over-expressed, the assay 
relying on both tests leads to a more accurate classification of the cancer and a more effective 
treatment of it. 

Thus, based on the asserted utility for PR0232 in the diagnosis of selected lung or colon 
tumors, the reduction to practice of the instantly claimed protein sequence of SEQ ID NO: 18 in 
the present application (also see pages 3-4 and Example 4, page 149-150), the step-by- step 
preparation of chimeric PRO polypeptides, including those wherein the heterologous polypeptide 
is an epitope tag or an Fc region of an immunoglobulin (page 74, lines 23 to page 75, line 5), the 
description of the expression of PRO polypeptides in various host cells, including E. coli, 
mammalian cells, yeast and Baculovirus-infected insect cells at least in Examples 53-56, pages 
192-199, the disclosure of the step-by-step protocol for the preparation, isolation and detection of 
monoclonal, polyclonal and other types of antibodies against the PR0232 protein in the 
specification (monoclonal and polyclonal antibodies at page 139, line 32, to page 141, line 13; 
humanized antibodies at page 141, line 15, to page 142, line 16; antibody fragments at page 143, 
line 8 onwards; labeled antibodies at pages 144-145, line 16 onwards and page 146, line 33 to 
page 147, line 6) and the disclosure of the gene amplification assay in Example 92, the skilled 
artisan would know exactly how to make and use the claimed polypeptides for the diagnosis of 
lung or colon cancers. Appellants submit that based on the detailed information presented in the 
specification and the advanced state of the art in oncology, the skilled artisan would have found 
such testing routine and not 'undue.' 

Therefore, since the instantly claimed invention is supported by either a credible, specific 
and substantial asserted utility or a well-established utility, one skilled in the art would know 
"how to make and use" the claimed invention without undue experimentation, Appellants 
respectfully request reconsideration and reversal of the determination of priority for Claims 44- 
4'6 and 49-51. 

ISSUE 2. Claims 44-46 and 49-51 should be entitled to the priority date of U.S. Provisional 
Application 60/059121, filed September 17, 1997 

The instant application has not been granted the earlier priority date of U.S. Provisional 
Application 60/059121, filed September 17, 1997 on the grounds that the prior 60/059121 
application fails to provide a utility and lacks an enabling disclosure for the claimed invention 
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under 35 U.S.C.§§ 101/1 12, first paragraph." (Page 14 of the Final Office Action mailed 
October 21, 2005). Appellants disagree and submit that, for the same reasons discussed above 
under Issue 1, U.S. Provisional Application 60/059121 also satisfies the utility requirements. 
Therefore, Appellants should be entitled to the priority date of September 17, 1997. 

ISSUE 3. Claims 44-46 and 49-51 are not anticipated by Rosenthal et aL DE19818619-A1 
(dated 10/1999) 

Claims 44-46 and 49-51 remain rejected under 35 U.S.C. § 102(b) as being anticipated by 
Rosenthal et ai, DE19818619-A1 (dated 10/1999). 

For the reasons discussed above under Issue 2, Appellants maintain that they are entitled 
to an effective filing date of September 17, 1997 based on a properly claimed priority to 
International application PCTYUS98/18824. Therefore, Rosenthal et al is not prior art and does 
not anticipate the instant claims. Accordingly, this rejection under 35 U.S.C. § 102(b) should be 
withdrawn. 

CONCLUSION 

For the reasons given above, Appellants submit that present specification and the 
specification of U.S. Provisional Application 60/059121 dated September 17, 1997 clearly 
describes and provides at least one patentable utility for the instantly claimed invention. 
Moreover, it is respectfully submitted that the present specification clearly teaches "how to use" 
the presently claimed polypeptide based upon this disclosed patentable utility. Accordingly, 
Rosenthal et al 9 DE19818619-A1 is not prior art. As such, Appellants respectfully request 
reconsideration and reversal of the outstanding rejection of claims 44-46 and 49-51. 
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The Commissioner is authorized to charge any fees which may be required, including 
extension fees, or credit any overpayment to Deposit Account No. 08-1641 (referencing 
Attorney's Docket No. 39780-1618 P2C18 . 
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IX. CLAIMS APPENDIX 
Claims on Appeal 



44. An isolated polypeptide comprising: 

(a) the amino acid sequence of the polypeptide of SEQ ID NO: 1 8; 

(b) the amino acid sequence of the polypeptide of SEQ ID NO: 18, lacking its 
associated signal peptide; or 

(c) the amino acid sequence of the polypeptide encoded by the full-length coding 
sequence of the cDNA deposited under ATCC accession number 209250; 

wherein, the nucleic acid encoding said polypeptide is amplified in lung or colon tumors. 

45. The isolated polypeptide of Claim 44 comprising the amino acid sequence of the 
polypeptide of SEQ ID NO: 18. 

46. The isolated polypeptide of Claim 44 comprising the amino acid sequence of the 
polypeptide of SEQ ID NO: 18, lacking its associated signal peptide. 

49. The isolated polypeptide of Claim 44 comprising the amino acid sequence of the 
polypeptide encoded by the full-length coding sequence of the cDNA deposited under ATCC 
accession number 209250. 

50. A chimeric polypeptide comprising a polypeptide according to Claim 44 fused to 
a heterologous polypeptide. 

5 1 . The chimeric polypeptide of Claim 50, wherein said heterologous polypeptide is 
an epitope tag or an Fc region of an immunoglobulin. 
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X. EVIDENCE APPENDIX 

1. Declaration of Avi Ashkenazi, Ph.D. under 35 C.F.R. 1.132, with attached Exhibit A 
(Curriculum Vitae). 

2. Declaration of Paul Polakis, Ph.D. under 35 C.F.R. 1.132. 

3. Orntoft, T.F., et al., "Genome-wide Study of Gene Copy Numbers, Transcripts, and 
Protein Levels in Pairs of Non-Invasive and Invasive Human Transitional Cell Carcinomas," 
Molecular & Cellular Proteomics 1:37-45 (2002). 
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Medical Laboratories (1999). 

7. Declaration of Audrey Goddard, Ph.D. under 35 C.F.R. §1.132, with attached Exhibits A-G: 
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sequences," Biotechnology 10:413-417 (1992). 

C. Livak, K.J., et al., "Oligonucleotides with fluorescent dyes at opposite ends 
provide a quenched probe system useful for detecting PCR product and nucleic acid 
hybridization," PCR Methods Appl. 4:357-362 (1995). 

D. Heid, C.A. et al., "Real time quantitative PCR," Genome Res. 6:986-994 (1996). 

E. Pennica, D. et al., "WISP genes are members of the connective tissue growth 
factor family that are up-regulated in Wnt-1 -transformed cells and aberrantly expressed 
in human colon tumors " Proc. Natl Acad Sci. USA 95:14717-14722 (1998). 

F. Pitti, R.M. et al., "Genomic amplification of a decoy receptor for Fas ligand in 
lung and colon cancer," Nature 396:699-703 (1998). 

G. Bieche, I, et al., "Novel approach to quantitative polymerase chain reaction using 
real-time detection: Application to the detection of gene amplification in breast cancer," 
Int. J. Cancer 78:661-666 (1998). 
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8. Haynes et aL, "Proteome analysis: Biological assay or data archive?" Electrophoresis 
19:1862-1871 (1996). 

9. Pennica, D. et al, "WISP genes are members of the connective tissue growth factor 
family that are up-regulated in Wnt-1 -transformed cells and aberrantly expressed in 
human colon tumors/' Proc. Natl Acad. Set USA 95:14717-14722 (1998). 
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11. Hu et al, "Analysis of genomic and proteomic data using advanced literature mining," 
Proteome Res. 2: 405-412 (2003). 

12. Konopka et al, "Variable Expression of the Translocated c-abl oncogene in Philadelphia- 
chromosome-positive B-lymphoid cell lines from chronic myelogenous leukemia patients" Proc. 
Natl Acad. Set USA 83: 4049-52, (1986). 

13 . Lian et al, "Genomic and proteomic analysis of the myeloid differentiation program," 
Blood 98: 513-524 (2001). 

14. Fessler et al, "A genomic and proteomic analysis of activation of the human neutrophil 
by lipopolysaccharide and its mediation by p38 mitogen-activated protein kinase," J. Biol Chem. 
277:31291-31302 (2002). 

Item 1 was submitted with Appellants' Response filed December 10, 2003, and was considered 
by the Examiner as indicated in the Office action mailed April 8, 2004. 

Items 2-6 were submitted with Appellants' Response filed August 9, 2004, and were considered 
by the Examiner as indicated in the Office action mailed December 6, 2004. 

Item 7 was submitted with Appellants' Response filed July 25, 2005, and were considered by the 
Examiner as indicated in the second Final Office action mailed October 21, 2005. 

Items 8-10 were made of record by the Examiner in the Office Action mailed September 29, 
2003. 



Items 11-12 were made of record by the Examiner in the Office Action mailed February 23, 
2005. 
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Items 13-14 were made of record by the Examiner in the Final Office Action mailed October 21, 
2005. 
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XI. RELATED PROCEEDINGS APPENDIX 

None- no decision rendered by a Court or the Board in any related proceedings identified 

above. 
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I, Avi Ashkenazi, Ph.D. declare and say as follows: - 

1 . I am Director and Staff Scientist at the Molecular Oncology Department of 
Genentech, Inc., South San Francisco, CA 94080. 

2. I joined Genentech in 1988 as a postdoctoral fellow. Since then, I have 
investigated a variety of cellular signal transduction mechanisms, including apoptosis, and have 
developed technologies to modulate such mechanisms as a means of therapeutic intervention in 
cancer and autoimmune disease. I am currently involved in the investigation of a series of 
secreted proteins over-expressed in tumors, with the aim to identify useful targets for the 
development of therapeutic antibodies for cancer treatment. 

3. My scientific Curriculum Vitae, including my list of publications, is attached to 
and forms part of this Declaration (Exhibit A). 

4. Gene amplification is a process in which chromosomes undergo changes to 
contain multiple copies of certain genes that normally exist as a single copy, and is an important 
factor in the pathophysiology of cancer. Amplification of certain genes (e.g., Myc or Her2/Neu) 



gives cancer cells a growth or survival advantage relative to normal cells, and might also provide 
a mechanism of tumor cell resistance to chemotherapy or radiotherapy. 

5. If gene amplification results in over-expression of the mRNA and the 
corresponding gene product, then it identifies that gene product as a promising target for cancer 
therapy, for example by the therapeutic antibody approach. Even in the absence of over- 
expression of the gene product, amplification of a cancer marker gene - as detected, for example, 
by the reverse transcriptase TaqMan® PCR or the fluorescence in situ hybridization (FISH) 
assays -is useful in the diagnosis or classification of cancer, or in predicting or monitoring the 
efficacy of cancer therapy. An increase in gene copy number can result not only from 
intrachromosomal changes but also from chromosomal aneuploidy. It is important to understand 
that detection of gene amplification can be used for cancer diagnosis even if the determination 
includes measurement of chromosomal aneuploidy. Indeed, as long as a significant difference 
relative to normal tissue is detected, it is irrelevant if the signal originates from an increase in the 
number of gene copies per chromosome and/or an abnormal number of chromosomes. 

6. I understand that according to the Patent Office, absent data demonstrating that 
the increased copy number of a gene in certain types of cancer leads to increased expression of 
its product, gene amplification data are insufficient to provide substantial utility or well 
established utility for the gene product (the encoded polypeptide), or an antibody specifically 
binding the encoded polypeptide. However, even when amplification of a cancer marker gene 
does not result in significant over-expression of the corresponding gene product, this very 
absence of gene product over-expression still provides significant information for cancer 
diagnosis and treatment. Thus, if over-expression of the gene product does not parallel gene 
amplification in certain tumor types but does so in others, then parallel monitoring of gene 
amplification arid gene product over-expression enables more accurate tumor classification and 
hence better determination of suitable therapy. In addition, absence of over-expression is crucial 
information for the practicing clinician. If a gene is amplified but the corresponding gene 
product is not over-expressed, the clinician accordingly will decide not to treat a patient with 
agents that target that gene product. 

7. I hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information or belief are believed to be true, and further that 
these statements were made with the knowledge that willful false statements and the like so 




made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful statements may jeopardize the validity of the 
application or any patent issued thereon. 

By: A^A*M^~x^ Date: g/i^fcy 

Avi Ashkenazi, Ph.D. * 



SV 455281 vl 

9/12/03 3:06 PM (39780.7000) 




CURRICULUM VITAE 
Avi Ashkenazi 

July 2003 



Personal: 

Date of birth: 29 November, 1956 

Address: 1456 Tarrytown Street, San Mateo, CA 94402 

Phone: (650) 578-9199 (home); (650) 225-1853 (office) 

Fax: (650) 225-6443 (office) 

Email: aa@gene.cdm 

Education: 

1983: B.S. in Biochemistry, with honors, Hebrew University, Israel 

1 986: Ph.D. in Biochemistry, Hebrew University, Israel 



Employment: 

1983-1986: 

1985- 1986: 

1986- 1988: 

1988- 1989: 

1989 - 1993: 
1994-1996: 

1996- 1997: 

1997- 1990: 
1999 -2002: 
2002-present: 



Teaching assistant, undergraduate level course in Biochemistry 
Teaching assistant, graduate level course on Sigual Transduction 
Postdoctoral fellow, Hormone Research Dept., UCSF, and 
Developmental Biology Dept., Genentech, Inc., with J. Ramachandran 
Postdoctoral fellow, Molecular Biology Dept., Genentech, Inc., 
with D. Capon 

Scientist, Molecular Biology Dept., Genentech, Inc. 
Senior Scientist, Molecular Oncology Dept., Genentech, Inc. 
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Editorial: 

Editorial Board Member: Current Biology 
Associate Editor, Clinical Cancer Research, 
Associate Editor, Cancer Biology and Therapy. 

Refereed papers: 

1 . Gertler, A., Ashkenazt A;, and Madar, Z. Binding sites for human growth 
hormone and ovine and bovine prolactins in the mammary gland and liver of the 
lactating cow. Mol Cell Endocrinol 34, 51^57 (1984). 

2. Gertler, A., Shamay, A., Cohen, N., Ashkenazl A., Friesen, H., Levanon, A., 
Gorecki, M., Aviv, H., Hadari, D., and Vogel, T. Inhibition of lactogenic 
activities of ovine prolactin and human growth hormone (hGH) by a novel form of 
a modified recombinant hGH. Endocrinology 118, 720-726 (1986). 

3. Ashkenazu A., Madar, Z., and Gertler, A. Partial purification and characterization 
of bovine mammary gland prolactin receptor. Mol Cell Endocrinol 50, 79-87 
(1987). 

4. Ashkenazi, Ay, Pines, M., and Gertler, A. Down-regulation of lactogenic 
hormone receptors in Nb2 lymphoma cells by cholera toxin. Biochemistry 
Internatl 14, 1065-1072 (1987). 

5. Ashkenazu A., Cohen, R., and Gertler, A. Characterization of lactogen receptors 
in lactogenic hormone-dependent and independent Nb2 lymphoma cell lines. 
FEBSLett 210, 51-55 (1987). 

6. Ashkenazi. A., Vogel T., Barash, I., Hadari, P., Levanon, A., Gorecki, M., and 
Gertler, A. Comparative study on in vitro and in vivo modulation of lactogenic 
and somatotropic receptors by native human growth hormone and its modified 
recombinant analog. Endocrinology 121, 414-419 (1987). 

7. Peralta, E., Winslow, J., Peterson, G., Smith, D., Ashkenazu A., Ramachandran, 
J., Schimerlik, M. 9 and Capon, D. Primary structure and biochemical properties 
of an M2 muscarinic receptor. Science 236, 600-605 (1987). 

8. Peralta, E. AshkenazL A.. Winslow, J., Smith, D., Ramachandran, J., and Capon, 
D. J. Distincrit primary structures, ligand-binding properties and tissue-specific 
expression of four human muscarinic acetylcholine receptors. EMBO J. 6, 3923- 
3929(1987). 

9. Ashkenazu A., Winslow, J., Peralta, E., Peterson, G., Schimerlik, M., Capon, D., 
and Ramachandran, J. An M2 muscarinic receptor subtype coupled to both 
adenylyl cyclase and phosphoinositide turnover. Science 238, 672-675 (1987). 
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10. Pines, M., Ashkenazi. A.. Cohen-Chapnik, N., Binder, L., and Gertler, A. 
Inhibition of the proliferation of Nb2 lymphoma cells by femtomolar 
concentrations of cholera toxin and partial reversal of the effect by 12-o- 
tetradecanoyl-phorbol-13-acetate./. Cell. Biochem. 37, 119-129(1988). 

11. Peralta. E. Ashkenazi. A.. Winslow. J. Ramachandran, J., and Capon, D. 
Differential regulation of PI hydrolysis and adenylyl cyclase by muscarinic 
receptor subtypes. Nature 334, 434-437 (1988). 

1 2. Ashkenazi.. A. Peralta, E., Winslow, J., Ramachandran, J., and Capon, D. 
Functionally distinct G proteins couple different receptors to PI hydrolysis in the 
same cell. Cell 56, 487-493 (1989). 

13. Ashkenazi. A.. Ramachandran. J., and Capon, D. Acetylcholine analogue 
stimulates DNA synthesis in brain-derived cells via specific muscarinic 
acetylcholine receptor subtypes. Nature 340, 146-150 (1989). 

14. Lammare. D;. Ashkenazi. A.. Fleurv. S.. Smith, P., Sekaly, R., and Capon, D. 
The MHC-bmding and gpl20-binding domains of CD4 are distinct and separable. 
Science 245, 743-745 (1989). 

15. Ashkenazi.. A.. Presta, L., Marsters, S., Camerato, T., Rosenthal, K., Fendly, B., 
and Capon, D. Mapping the CD4 binding site for human irnmuhodefficiency 
virus type 1 by alanine-scanning mutagenesis. Proc. Natl. Acad. Sci. USA. 87, 
7150-7154(1990). 

16. Chamow, S., Peers, D., Byrn, R., Mulkerrin, M., Harris, R., Wang, W., Bjorkman, 
P , Capon, D. , and Ashkenazi. A. Enzymatic cleavage of a CD4 immunoadhesin 
generates crystallizable, biologically active Fd-like fragments. Biochemistry 29, 
9885-9891 (1990). 

17. Ashkenazi. A;. Smith, D., Marsters, S., Riddle, L., Gregory, T., Ho, D., and 
Capon, D. Resistance of primary isolates of human immunodefficiehcy virus type 
1 to soluble CD4 is independent of CD4-rgpl20 binding affinity. Proc. Natl. 
Acad: Sci. USA. 88, 7056-7060 (1991). 

18. Ashkenazi. A.. Marsters, S., Capon, D., Chamow, S., Figari., L, Pennica, D., 
Goeddel, D., Palladino, M., and Smith, D. Protection against ertdotoxic shock by 
a tumor necrosis factor receptor immunoadhesin. Proc. Natl. Acad. Sci. USA. 88, 
10535-10539(1991). 

19. Moore J.. McKeating. J.. Huang. Y.. Ashkenazi. A ., and Ho. D. Virions of 
primary HTV-1 isolates resistant to sCD4 neutralization differ in sCD4 affinity and 
glycoprotein gpl20 retention from sCD4-sensitive isolates. /. Virol 66, 235-243 
(1992). 
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20. Jin, H., Oksenberg, D., Ashkenazi, A., Peroutka, S., Duncan, A., RozmaheL, R., 
Yang, Y., Mengod, G., Palacios, J., and ODowd, B. Characterization of the 
human 5-hydroxytryptamineiB receptor. J. Biol Chem. 267, 5735-5738 (1992). 

21. Marsters, A., Frutkin, A., Simpson, N., Fendly, B. and Ashkenazi. A. 
Identification of cysteine-rich domains of the type 1 tumor necrosis receptor 
involved in ligarid binding. J. Biol Chem. 267, 5747-5750 (1992). 

22. Chamow, S., Kogaii, T., Peers, D., Hastings, R., Byrn, R., and Ashkenazi, A. 
Conjugation of sCD4 without loss of biological activity via a novel carbohydrate- 
directed cross-linking reagent. J.Biol Chem. 267, 15916-15922 (1992). 

23. Oksenberg, D., Marsters, A., ODowd, B., Jin, H., Havlik, S., Peroutka, S., and 
Ashkenazi, A. A single amino-acid difference confers major pharmacologic 
variation between human and rodent 5-HTib receptors. Nature 360, 161-163 

(1992) . 

■24. Haak-Frendscho, M., Marsters, S., Chamow, S., Peers, D., Simpson, N., and 
Ashkenazt A. Inhibition of interferon y by an interferon y receptor 
immunoadhesin. Immunology 79, 594-599 (1993). 

25. Penica, D., Laim, V , Weber, R., Kohr, W., Basa, L., Spellmah, M., Ashkenazi, 
Shire, S., and Gbeddel, D. Biochemical characterization of the extracellular 
domain of the 75-kd tumor necrosis factor receptor. Biochemistry 32,3131-3138. 

(1993) . 

26. Barfod, L., Zheng, Y., Kuang, W., Hart, M., Evans, T., Cerione, R., and 
Ashkenazi, A. Cloning and expression of a human CDC42 GTPase Activating 
Protein reveals a functional SH3-binding domain. J. Biol Chem. 268, 26059- 
26062(1993). 

27. Chamow, S, Zhang, D., Tan, X., Mhtre, S., Marsters, S., Peers, D., Byrn, R., 
Ashkenazi, A., and Yunghans, R. A humanized bispecific immunoadhesin- 
antibody that retargets CD3+ effectors to kill HIV-1 -infected cells. J. Immunol 
153, 4268-4280 (1994), 

28. Means, R., Krantz, S., Luna, J., Marsters, S., and AshkenazL A. Inhibition of 
murine erythroid colony formation in vitro by iterferon y and correction by 
interferon y receptor immunoadhesin. Blood 83, 91 1-915 (1994). 

29. Haak-Frendscho, M., Marsters, S., Mordenti, J., Gillet, N., Chen, S., 
andAshkenazi, A. Inhibition of TNF by a TNF receptor immunoadhesin: 
comparison with an anti-TNF mAb. J. Immunol 152, 1347-1353 (1994). 
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30. Chamow, S., Kogan, T., Venuti, M., Gadek, T., Peers, D., Mordenti, J., Shak, S., 
and AshkenazL A. Modification of CD4 immunoadhesin with monomethoxy- 
PEG aldehyde via reductive alkilation. Bioconj. Cherru 5, 133-140 (1994). 

31. Jin, H., Yang, R., Marsters, S., Bunting, S., Wurm, R, Chamow, S., and 
Ashkenazi, A. Protection against rat endotoxic shock by p55 tumor necrosis factor 
(TNF) receptor immunoadhesin: comparison to anti-TNF monoclonal antibody. J. 
Infect Diseases 170, 1323-1326 (1994). 

32. Beck, J., Marsters, S., Harris, R., Ashkenazi. A., and Chamow, S. Generation of 
soluble iriterleukin-1 receptor from an immunoadhesin by specific cleavage. Mol 
/mmw/ic?7. 31, 1335-1344(1994). 

33. Pitti, B., Marsters, M., Haak-Frendscho, M., Osaka, G., Mordenti, J., Chamow, S., 
and Ashkenazi. A. Molecular and biological properties of an interleukin-1 
receptor immunoadhesin. Mol Immunol 31, 1345-1351 (1994). 

34. Oksenberg, D , Havlik, S., Peroutka, S., and Ashkenazi. A. The third intracellular 
loop of the 5-HT2 receptor specifies effector coupling. J. Neurochenu 64, 1440- 
1447(1995). 

35. Bach, E., Szabo, S., Dighe, A., Ashkenazi. A.. Aguet, M., Murphy, K., and 
Schreiber, R. Ligarid-induced autoregulation of IFN-y receptor p chain expression 
in T helper cell subsets. Science 270, 1215-1218 (1995). 

36. Jin, EL, Yang, R, Marsters, S,, Ashkenazi, A., Bunting, S., Marra, M., Scott, R., 
and Baker, J. Protection against endotoxic shock by bactericidal/permeability- 
increasing protein in rats. J. Clin. Invest, 95, 1947-1952 (1995). 

37. Marsters, S., Penica, D., Bach, R, Schreiber, R., and Ashkenazi, A. Interferon y 
signals via a high-affinity multisubunit receptor complex that contains two types 
of polypeptide chain. Proc. Natl Acad. Set USA. 92, 5401-5405 (1995). 

38. Van Zee, K., Moldawer, L., Oldenburg, H., Thompson, W., Stackpole, S., 
Mdntegut, W., Rogy, M., Meschter, C, Gallati, H., Schiller, C, Richter, W., 
Loetcher, H., Ashkenazi, A ., Chamow, S., Wurm, F., Calvano, S., Lowry, S., and 
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Immunol 156, 2221-2230 (1996). 

39. Pitti, R, Marsters, S., Ruppert, S., Donahue, C, Moore, A., and Ashkenazi. A . 
Induction of apbptosis by Apo-2 Ligand, a new member of the tumor necrosis 
factor cytokine family. J. Biol Chem. 271, 12687-12690 (1996). 
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Activation of apoptosis by Apo-2 ligand is independent of F ADD but blocked by 
CrmA. Curr. Biol 6, 1669-1676 (1996). 
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42. Sheridan, J., Marsters, S., Pitti, R., Gumey, A., Skubatch, M., Baldwin, D., 
Ramakrishhari, L., Gray, C, Baker, K., Wood, W.I., Goddard, A,, Godowski, P., and 
Ashkenazi, A. Control of TRAIL-induced apoptosis by a family of signaling and 
decoy receptors. Science !!!, 818-821 (1997). 

43. Marsters, S., Sheridan, J., Pitti, R., Gumey, A., Skubatch, M., Balswin, D., Huang, A., 
Yuan, J., Goddard, A., Godowski, P., and Ashkenazi. A. A novel receptor for 
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identification of a ligand for the death-domain-containing receptor Apo3. Curr. Biol 
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a novel weapon against malignant glioma? FEBS Lett 427, 124-128 (1998). 
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receptor immunoadhesin prevents T cell mediated intestinal injury by inhibiting 
matrix metalloproteinase production, 7. Immunol 160, 4098-4103 (1998). 
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48. Mori, S., Marakami-Mori, K., Nakamura, S., Ashkenazi, A ., and Bonavida, B. 
Sensitization of AIDS Kaposi's sarcoma cells to Apo-2 ligand-induced apoptosis 
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Godowski, P., and AshkenazL A. Identification of a new member of the tumor 
necrosis factor family and its receptor, a human ortholog of mouse GFTR. Curr. 
Biol 9, 215-218 (1999). 
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50. Ashkenazl A ., Pai, R., Fong, s., Leung, S., Lawrence, D., Marsters, S., Blackie, 
C, Chang, L., McMurtrey, A., Hebert, A., DeForge, L., Khoumenis, L, Lewis, D., 
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anti-tumor activity of recombinant soluble Apo2 ligand J. Clin. Invest 104, 155- 
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51. Chimtharapai, A., Gibbs, V., Lu, L, Ow, A., Marsters, S., Ashkenazi, A., De Vos, 
A., Kim, KJ. Determination of residues involved in ligand binding and signal 
transmission in the human IFN-a receptor 2. J. Immunol 163, 766-773 (1999). 

52. Johnsen, A.-C, Haux, J., Steinkjer, B., Nonstad, U., Egeberg, K., Sundan, A., 
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cells - involvement in NK cell-mediated cytotoxicity. Cytokine 11, 664-672 
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53. Roth, W., Isenmann, S., Naumann, U., Kugler, S., Bahr, M., Dichgans, 
Ashkenazi. A., and Weller, M. Eradication of intracranial human malignant 
glioma xenografts by Apo2L/TRAIL. Biochem. Biophys. Res. Commun. 265, 479- 
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54. Hymowitz, S.G., Christinger, H.W., Fuh, G., Ultsch, M., O'Connell, M., Kelley, 
R.F. , Ashkenazi, A. and de Vos, A.M. Triggering Cell Death: The Crystal 
Structure of Apo2L/TRAIL in a Complex with Death Receptor 5. Molec. Cell 4, 
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55. Hymowitz, S.G., O'Connel, M.P., Utsch, M.H., Hurst, A., Totpal, K., Ashkenazi. 
A;, de Vos, A.M., Kelley, R.F. A unique zinc-binding site revealed by a high- 
resolution X-ray structure of homotrimeric Apo2L/TRAIL. Biochemistry 39 ; 633- 
640(2000). 

56. Zhou, Q., Fukushima, P., DeGraff, W., Mitchell, J.B., Stetler-Stevenson, M, 
Ashkenazi, A., and Steeg, P.S. Radiation and the Apo2L/T RAIL apoptotic 
pathway preferentially inhibit the colonization of premalignant human breast 
cancer cells overexpressing cyclin Dl. Cancer Res. 60, 261 1-2615 (2000), 

57. Kischkel, F.C., Lawrence, D. A., Chuntharapai, A., Schow, P., Kim, J., and 
Ashkenazi, A. Apo2I/TRAIL-dependent recruitment of endogenous FADD and 
Caspase-8 to death receptors 4 and 5 . Immunity 12,611 -620 (2000). 

58. Yan, M., Marsters, S.A., Grewal, I.S., Wang, H., * Ashkenazi. A., and *Dixit, 
V.M. Identification of a receptor for BlyS demonstrates a crucial role in humoral 
immunity. Nature Immunol 1, 37-41 (2000). 
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59. Marsters, S.A., Yan, M., Pitti, R.M., Haas, P.E., Dixit, V.M., and AshkenazL A. 
Interaction of the TNF homologies BLyS and APRIL with the TNF receptor 
homologies BCMA and TACI. Curr. Biol. 10, 785-788 (2000). 

60. Kischkel, F.C., and Ashkenazu A . Combining enhanced metabolic labeling with 
immunoblotting to detect interactions of endogenous cellular proteins. 
Biotechniques 29, 506-512 (2000). 

61". Lawrence, D., Shahrokh, Z., Marsters, S., Achilles, K., Shih, D. Mounho, B., 

Hillan, K , Tdtpal, K. DeForge, L., Schow, P., Hooley, J., Sherwood, S., Pai, R., 
Leung, S., Khan, L., Gliniak, B., Bussiere, J., Smith, C, Strom, S., Kelley, S., 
Fox, J., Thomas, P., and Ashkenazi. A. Differential hepatocyte toxicity of 
recombinant Apo2L/TRAIL versions. Nature Med. 7, 383-385 (2001). 

62. Chuntharapai, A., Dodge, K., Grimmer, K., Schroeder, K., Martsters, S.A., 
Koeppen, H., Ashkenazi; A ., and Kim. KJ. Isotype-dependent inhibition of 
tumor growth in vivo by monoclonal antibodies to death receptor 4. J. Immunol. 
166, 4891-4898 (2001). 

63. Pollack, I.F., Erff, M., and Ashkenazi/A . Direct stimulation of apoptotic 
signaling by soluble Apo2L/tumor necrosis factor-related apoptosis-inducing 
ligand leads to selective killing of glioma cells. Clin. Cancer Res. 7, 1362-1369 
(2001). 

64. Wang, H., Marsters, S. A., Baker, T., Chan, B., Lee, W.P., Fu, L., Tumas, D., Yari, 
M, Dixit, V.M., * AshkenazL A ., and *Grewal, LS. TACI-ligand interactions are 
required for T cell activation and collagen-induced arthritis in mice. Nature 
Immunol. 2, 632-637 (2001). 

65. Kischkel, F.C., Lawreirice, D. A., Tinel, A., Virmani, A., Schow, P., Gazdar, A., 
Blenis, J., Arnott, p., arid Ashkenazi. A . Death receptor recruitment of 
endogenous caspase- 10 and apoptosis initiation in the absence of caspase-8. J. 
Biol Chern. 276, 46639-46646 (2001). 

66. LeBlanc, H., Lawrence, D.A., Varfolomeev, E., Totpal, K., Morian, J., Schow, P.; 
Fong, S., Schwall, R., Sinicropi, D., and Ashkenazi. A T umor cell resistance to 
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67. Miller, K., Meng, G., Liu, J., Hurst, A., Hsei, V., Wong, W-L., Ekert, R., 
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Review articles: 

1 . AshkenazL A., Peralta, E., Winslow, J., Ramachandran, J., and Cappn, D., J. 
Functional role of muscarinic acetylcholine receptor subtype diversity. Cold 
Spring Harbor Symposium on Quantitative Biology. Lm, 263-272 (1988). 

2. AshkenazL A .. Peralta, IL Winslow. J. T Ramachandran, J. T and Capon, D. 
Functional diversity of muscarinic receptor subtypes in cellular signal 
transduction and growth. Trends Pharmacol Scu Dec Supplement, 12-21 (1989). 

3. Chamow, S., Duliege, A., Ammann, A., Kahn, J., Allen, D,, Eichberg, J., Byrn, 
R., Capon, D;,' Ward, R., and AshkenazL A . CD4 immunoadhesins in anti-HtV 
therapy: new developments. Int. J. Cancer Supplement 7, 69-72 (1992). 

4. AshkenazL A ., Capon, and D. Ward, R. Immunoadhesiris. Int. Rev. Immunol. 10, 
217-225 (1993). , 

5. AshkenazL A ., and Peralta, E. Muscarinic Receptors. In Handbook of Receptors 
and Channels. (S. Peroutka, ed.), CRC Press, Boca Raton, Vol. I, p. 1-27, (1994). 

6. Krantz, S. B., Means, R. T., Jr., Lina, J., Marsters, S. A., and AshkenazL A . 
Inhibition of erythroid colony formation in vitro by gamma interferon. In 
Molecular Biology of Hematopoiesis (N. Abraham, R. Shadduck, A. Levine F. 
Takaku, eds.) Intercept Ltd. Paris, Vol. 3, p. 135-147 (1994). 

7. AshkenazL A . Cytokine neutralization as a potential therapeutic approach for 
SIRS and shock. J. Biotechnology in Healthcare 1, 197-206 (1994). 

8. AshkenazL A :, arid Chamow, S. M. Immunoadhesins: an alternative to human 
monoclonal antibodies. Immunomethods: A companion to Methods in 
Enzimology 8, 104-115 (1995). 

9. Chamow, S., and AshkenazL A . Immunoadhesins: Principles and Applications. 
Trends Biotech 14, 52-60 (1996). 

10. AshkenazL A ., and Chamow. S. M. Immunoadhesins as research tools and 
therapeutic agents. Curr. Opin. Immunol. 9, 195-200 (1997). 

1 1 . AshkenazL A ., and Dixit, V. Death receptors: signaling and modulation. Science 
281, 1305-1308 (1998). 

12. AshkenazL A ., and Dixit, V. Apoptosis control by death and decoy receptors. 
Curr. Opin. Cell Biol 11, 255-260 (1999). 



9 




13. Ashkenazi. A . Chapters on Apo2L/TRAIL; DR4, DR5, DcRl, DcR2; and DcR3. 
Online Cytokine Handbook (www.apnet.com/cvtokinereferenceA . 

14. Ashkenazi. A . Targeting death and decoy receptors of the tumor necrosis factor 
superfamily. Nature Rev. Cancer 2, 420-430 (2002). 

15. LeBlanc, H. and Ashkenazi, A . Apoptosis signaling by Apo2L/TRAIL. Cell Death 
and Differentiation 10, 66-75 (2003). 

16. Almasan, A. ahd Ashkenazi. A . Apo2L/TRAIL: apoptosis signaling, biology, and 
potential for cancer therapy. Cytokine and Growth Factor Reviews 14, 337-348 
(2003). 

Book: 

Antibody Fusion Proteins (Chamow, S., and Ashkenazi. A ., eds., John Wiley and 
Sons Inc.) (1999). 

Talks: 

1 . Resistance of primary HIV isolates to CD4 is independent of CD4-gp 1 20 binding 
affinity. UCSD Symposium, HIV Disease: Pathogenesis and Therapy. 
Greenelefe, FL, March 1991. 

2. Use of immuno-hybrids to extend the half-life of receptors. IBC conference on 
Biopharmaceutical Halflife Extension. New Orleans, LA, June 1992. 

3. Results with TNF recejptor immunoadhesins for the Treatment of Sepsis. IBC 
conference on Endotoxemia and Sepsis. Philadelphia, PA, June 1992. 

4. Immuhdadh^sins: an alternative to human antibodies. IBC conference on 
Antibody Engineering. San Diego, CA, December 1993. 

5. Tumor necrosis factor receptor: a potential therapeutic for human septic shock. 
American Society for Microbiology Meeting, Atlanta, GA, May 1 993. 

6. Protective efficiacy of TNF receptor immunoadhesin vs anti-TNF monoclonal 
antibody in a rat model for ehdotoxic shock. 5th International Congress on TNF. 
Asilomar, CA, May 1994. 
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Stanford University School of Medicine, Palo Alto, CA, December 1 996. 

13. Apo-3: anovel receptor that regulates cell death and inflammation. 4th 
International Congress on Immune Consequences of Trauma, Shock, and Sepsis. 
Munich, Germany, March 1997. 

14. New members of the TNF ligand and receptor families that regulate apoptosis, 
inflammation, and immunity. UCLA School of Medicine, LA, CA, March 1997. 

15. Immundadhesins: an alternative to monoclonal antibodies. 5th World Conference 
on Bispecific Antibodies. Volendam, Holland, June 1997. 

16. Control of Apo2L signaling. Cold Spring Harbor Laboratory Symposium on 
Programmed Cell Death. Cold Spring Harbor, New York. September, 1997. 

17. Chairman and speaker, Apoptosis Signaling session. DBC's 4th Annual 
Conference oh Apoptosis. San Diego, CA., October 1997. 

18. Control of Apo2L signaling by death and decoy receptors. American Association 
for the Advancement of Science. Philadelphia, PA, February 1 998. 

19. Apo2 ligaiid and its receptors. American Society of Immunologists. San 
Frahcisco, CA, April 1998. 

20. Death receptors and ligands. 7th International TNF Congress. Cape Cod, MA, 
May 1998, 

21. Apo2L as a potential therapeutic for cancer. UCLA School of Medicine. LA, 
CA, June 1998. 

22. Apb2L as k potential therapeutic for cancer. Gordon Research Conference on 
Cancer Chemotherapy. New London, NH, July 1998. 

23. Control of apoptosis by Apo2L. Endocrine Society Conference, Stevenson, WA, 
August 1998. 

24. Control of apoptosis by Apo2L. International Cytokine Society Conference, 
Jerusalem, Israel, October 1998. 



11 



25. Apoptosis control by death and decoy receptors. American Association for 
Cancer Research Conference, Whistler, BC, Canada, March 1999. 

26. Apoptosis control by death and decoy receptors. American Society for 
Biochemistry and Molecular Biology Conference, San Francisco, C A, May 1 999. 

27. Apoptosis control by death and decoy receptors. Gordon Research Conference on 
Apoptosis, New London, NH, June 1999. 

28. Apoptosis control by death and decoy receptors. Arthritis Foundation Research 
Conference, Alexandria GA, Aug 1999. 

29. Safety and anti-tumor activity of recombinant soluble Apo2I7TRAIL. Cold 
Spring Harbor Laboratory Symposium on Programmed Cell Death. . Cold Spring 
Harbor, NY, September 1999. 

30. The Apo2L/TRAIL system: therapeutic potential. American Association for 
Cancer Research, Lake Tahoe, NV, Feb 2000. 

31. Apoptosis and cancer therapy. Stanford University School of Medicine, Stanford, 
CA y Mar 2000. 

. 32. Apoptosis and cancer therapy. University of Pennsylvania School of Medicine, 
Philadelphia, PA, Apr 2000. 

33. Apoptosis signaling by Apo2L/TRAIL. International Congress on TNF. 
Trondheim, Norway, May 2000. 

34. The Apo2L/TRAiL system: therapeutic potential. Cap-CURE summit meeting. 
Santa Monica, CA, June 2000. 

35. The Apo2L/TRAIL system: therapeutic potential. MD Anderson Cancer Center. 
Houston, TX, June 2000. 

36. Apoptosis signaling by Apo2L/TRAIL. The Protein Society, 14 th Symposium. 
San E>iego, CA, August 2000. 

37. Anti-tumor activity of Apo2L/TRAIL. AAPS annual meeting. Indianapolis, IN 
Aug 2000. 

38. Apoptosis signaling and anti-cancer potential of Apo2L/TRAIL. Cancer Research 
Institute, UC San Francisco, CA, September 2000. 

39. Apoptosis signaling by Apo2I/TRAIL. Kenote address, TNF family 
Minisymposium, NIH. Bethesda, MD, September 2000. 

40. Death receptors: signaling and modulation. Keystone symposium on the 
Molecular basis of cancer. Taos, NM, Jan 2001. 

41. Preclinical studies of Apo2L/TRAIL in cancer. Symposium on Targeted therapies 
in the treatment of lung cancer. Aspen, CO, Jan 2001. 



12 




42. Apoptosis signaling by Apo2L/TRAIL. Wiezmann Institute of Science, Rehovot, 
Israel, March 2001. 

43. Apo2L/TRAIL: Apoptosis signaling and potential for cancer therapy. Weizmann 
Institute of Science, Reho vot, Israel, March 200 1 . 

44. Targeting death receptors in cancer with Apo2L/TRAIL. Cell Death and Disease 
conference, North Falmouth, MA, Jun 2001 . 

45. Targeting death receptors in cancer with Apo2L/TRAIL. Biotechnology 
Organization conference, San Diego, CA, Jun 200 1 . 

46. Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Gordon Research 
Conference on Apoptosis, Oxford, UK, July 2001. 

47. Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Cleveland Clinic 
Foundation, Cleveland, OH, Oct 2001. 

48. Apoptosis signaling by death receptors: overview. International Society for 
Interferon and Cytokine Research conference, Cleveland, OH, Oct 2001. 

49. Apoptosis signaling by death receptors. American Society of Nephrology 
Conference. Sah Francisco, C A, Oct 2001. 

50. Targeting death receptors in cancer. Apoptosis: commercial opportunities. San 
Diego, CA, Apr 2002. 

5 1 . Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Kimmel Cancer 
Research Center, Johns Hopkins University, Baltimore MD. May 2002. 

52. Apoptosis control by Apo2I7TRAIL. (Keynote Address) University of Alabama 
Cancer Center Retreat, Birmingham, Ab. October 2002. 

53. Apoptosis signaling by Apo2L/TRAIL. (Session co-chair) TNF international 
conference. San Diego, CA. October 2002. 

54. Apoptosis signaling by Apo2L/TRAIL. Swiss Institute for Cancer Research 
(ISREC). Lausanne, Swizerland. Jan 2003. 

55. Apoptosis induction with Apo2L/TRAIL. Conference on New Targets and 
Innovative Strategies in Cancer Treatment. Monte Carlo. February 2003. 

56. Apoptosis signaling by Apo2L/TRAIL. Hermelin Brain Tumor Center 
Syrtiposium on Apoptosis. Detroit, MI. April 2003. 

57. Targeting apoptosis through death receptors. Sixth Annual Conference on 
Targeted Therapies in the Treatment of Breast Cancer. Kona, Hawaii. July 2003. 

58. Targeting apoptosis through death receptors. Second International Conference on 
Targeted Cancer Therapy. Washington, DC. Aug 2003. 

Issued Patents: 



13 



1. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 5,329,028 (Jul 12, 1994). 

2. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 5,605,791 (Feb 25, 1997). 

3. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 5,889,155 (Jul 27, 1999). 

4. Ashkenazi, A., APO-2 Ligand. US patent 6,030,945 (Feb 29, 2000). 

5. Ashkenazi, A., Chuntharapai, A., Kim, J., APO-2 ligand antibodies. US patent 6, 
046, 048 (Apr 4, 2000). 

6. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 6,124,435 (Sep 26, 2000). 

7. Ashkenazi, A., Chuntharapai, A., Kim, J., Method for making monoclonal and cross- 
reactive antibodies. US patent 6,252,050 (Jun 26, 2001). 

8. Ashkenazi, A. APO-2 Receptor. US patent 6,342,369 (Jan 29, 2002). 

9. Ashkenazi, A; Fdng, S., Goddard, A., Gurney, A., Napier, M., Tumas, D., Wood, W. 
A-33 polypeptides. US patent 6,410,708 (Jun 25, 2002). 

10. Ashkenazi, A. APO-3 Receptor. US patent 6,462,176 Bl (Oct 8, 2002). 

11. Ashkenazi, A. APO-2LI and APO-3 polypeptide antibodies, US patent 6,469,144 Bl 
(Oct 22, 2002). 

12. Ashkenazi, A., ChamoWj S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 6,582,928B1 (Jun 24, 2003). 



14 



o 



© 



DECLARATION OF PAUL POLAKIS, Ph.D. 
■I, Paul Polakis, Ph.O., declare and say as follows: 

1. I was awarded a Ph.D. by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins cc tumor antigen proteins". When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 





expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative t 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
thereon. 
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Non-invasive and Invasive Human Transitional 
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Gain and loss of chromosomal material is characteristic 
of bladder cancer, as well as malignant transformation in 
general. The consequences of these changes at both the 
transcription and translation levels is at present unknown 
partly because of technical limitations. Here we have at- 
tempted to address this question in pairs of non-invasive 
and invasive human bladder tumors using a combination 
of technology that included comparative genomic hybrid- 
ization, high density oligonucleotide array-based monitor- 
ing of transcript levels (5600 genes), and high resolution 
two-dimensional gel electrophoresis/the results showed - 
that there is a gene dosage effect that in some cases 
superimposes on other regulatory mechanisms. This ef- 
fect depended (p < 0.015) on the magnitude of the com- 
parative genomic hybridization change. In general (16 of 
23 cases), chromosomal areas with more than 2-fold gain 
of DNA showed a corresponding increase in mRNA tran- 
scripts. Areas with loss of DNA, on the other hand, 
showed either reduced or unaltered transcript levels, Be- 
cause most proteins resolved by two-dimensional gels 
are unknown it was only possible to compare mRNA and 
protein alterations in. relatively few cases of well focused 
abundant proteins. With few exceptions we found a good 
correlation (p < 0.005) between transcript alterations and 
protein levels. The implications, as well as limitations, 
of the approach are discussed. Molecular & Cellular 
Proteomics 1:37-45, 2002. 



Aneuploidy is a common feature of most human cancers 
(1), but little is known about the genome-wide effect of this 
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phenomenon at both the transcription and translation levels. 
High throughput array studies of the breast cancer cell line 
BT474 has suggested that there is a correlation between 
DNA copy numbers and gene expression in highly amplified 
areas (2), and studies of individual genes in solid tumors 
have revealed a good correlation between gene dose and 
mRNA or protein levels in the case of c-erb-B2 t cyclin dh 
emsh and N-myc (3-5). However, a high cyclin Di protein 
expression has been observed without simultaneous am- 
plification (4), and a low level of c-myc copy number in- 
crease was observed without concomitant c-myc protein 
overexpression (6). 

In human bladder tumors, karyotyping, fluorescent in situ 
hybridization, and comparative genomic hybridization (CGH) 1 
have revealed chromosomal aberrations that seem to be 
characteristic of certain stages of disease progression. In the 
case of non-invasive pTa transitional cell carcinomas (TCCs), 
this includes loss of chromosome 9 or parts of it, as well as 
loss of Y in males. In minimally invasive pT1 TCCs, the fol- 
lowing alterations have been reported: 2q-, 11 p-. 1q + . 
11ql3 + , 17q + , and 20q+ (7-12). It has been suggested that 
these regions harbor tumor suppressor genes and onco- 
genes; however, the large chromosomal areas involved often 
contain many genes, making meaningful predictions of the 
functional consequences of losses and gains very difficult. 

In this investigation we have combined genome-wide tech- 
nology for detecting genomic gains and losses (CGH) with 
gene expression profiling techniques (microarrays and pro- 
teomics) to determine the effect of gene copy number on 
transcript and protein levels in pairs of non-invasive and in- 
vasive human bladder TCCs. 

EXPERIMENTAL PROCEDURES 
- Material— Bladder tumor biopsies were sampled after informed 
consent was obtained and after removal of tissue for routine pathol- 
ogy examination. By light microscopy tumors 335 and 532 were 
staged by an experienced pathologist as pTa (superficial papillary), 



1 The abbreviations used are: CGH, comparative genomic hybrid- 
ization; TCC, transitional cell carcinoma; LOH, loss of heterozygosity; 
PA-FABP, psoriasis-associated fatty acid-binding protein; 2D, 
two-dimensional. 
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Fig. 1. DNA copy number and mRNA expression level. Shown from left to right are chromosome [Chr), CGH profiles, gene location and 
expression level of specific genes, and overall expression level along the chromosome. A, expression of mRNA In invasive tumor 733 as 
compared with the non-invasive counterpart tumor 335. B, expression of mRNA in invasive tumor 827 compared with the non-invasive 
counterpart tumor 532. The average fluorescent signal ratio between tumor. DNA and normal DNA is shown along the length of the chromosome 
{left) The bold curve in the ratio profile represents a mean of four chromosomes and is surrounded by thin curves indicating one standard 
deviation. The central vertical line (broken) indicates a ratio value of 1 (no change), and the vertical lines next to it (dotted) indicate a ratio of 
0 5 (left) and 2.0 (right), in chromosomes where the non-invasive tumor 335 used for comparison showed alterations in DNA content, the ratio 
profile of that chromosome is shown to the right of the invasive tumor profile. The colored bars represents one gene each, identified by the 
running numbers above the bars (the name of the gene can be seen at www.MDL.DK/sdata.html). The bars indicate the purported location of 
the gene and the colors indicate the expression level of the gene in the invasive tumor compared with the non-invasive counterpart; >2-fold 
increase (black) >2-fold decrease (blue), no significant change (orange). The bar to the far right, entitled Expression shows the resulting change 
in expression along the chromosome; the colors indicate that at least half of the genes were up-regulated (black), at least half of the genes 
down-regulated (blue), or more than half of the genes are unchanged (orange). If a gene was absent in one of the samples and present m 
another, it was regarded as more than a 2-fold change. A 2-fold level was chosen as this corresponded to one standard deviation in a double 
determination of -1800 genes. Centromeres and heterochromatic regions were excluded from data analysis. 



grade I and II, respectively, tumors 733 and B27 were staged as pTl 
(invasive into submucosal 733 was staged as solid, and 827 was 
staged as papillary, both grade III. 

mRNA Preparation— Tissue biopsies, obtained fresh from surgery, 
were embedded 'immediately In a sodium-guanidinium thiocyanate 
solution and stored at -80 °C. Total RNA was isolated using the 
RNAzol B RNA isolation method (WAK-Chemie Medical GMBH). 
poly(A)" RNA was isolated by an oligo(dT) selection step (Oligotex 
mRNA kit; Qiagen). 

cRNA Preparation— 1 m9 of mRNA was used as starting material. 
The first and second strand cDNA synthesis was performed using the 
Superscript® choice system (Invitrogen) according to the manufac- 
turer's instructions but using an oligo(dT) primer containing a T7 RNA 
polymerase binding site. Labeled cRNA was prepared using the ME- 
GAscrip® in vitro transcription kit (Ambion). Biotin-labeled CTP and 



UTP (Enzo) was used, together with unlabeled NTPs in the reaction. 
Following the in vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy columns (Qiagen). 

Array Hybridization and Scanning— Array hybridization and scan- 
ning was modified from a previous method (13). 10 ^g of cRNA was 
fragmented at 94 *C for 35 min in buffer containing 40 mM Tris 
acetate. pH 8.1, 100 mM KOAc, 30 mM MgOAc. Prior to hybridization, 
the fragmented cRNA in a 6x SSPE-T hybridization buffer (1 m NaCI. 
10 mM Tris, pH 7,6, 0.005% Triton), was heated to 95 °C for 5 min, 
subsequently cooled to 40 ''C, and loaded onto the Affymetrix probe 
array cartridge. The probe array was then incubated for 16 h at 40 °C 
at constant rotation (60 rpni). The probe array, was exposed to 10 
washes In 6 x SSPE-T at 25 9 C followed by 4 washes in 0.5* SSPE-T 
at 50 °C. The biotinylated cRNA was stained with a streptavidin- 
phycoerythrin conjugate. 10 /xg/ml (Molecular Probes) in 6x SSPE-T 
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Fig. 1 — continued 



for 30 min at 25 °C followed by 10 washes in 6x SSPE-T at 25 "C. The 
probe arrays were scanned at 560 nm using a confocal laser scanning 
microscope {made for Affymetrix by Hewlett-Packard). The readings 
from the quantitative scanning were analyzed by Affymetrix gene 
expression analysis software. 

Micrvsatefiite Analysis -Microsatellite Analysis was performed as 
described previously (14). Microsatellites were selected by use of 
www.ncbi.nlm.nth.gov/genemap98, and primer sequences were ob- 
tained from the genome data base at www.gdb.org. DNA was extracted 
from tumor and blood and amplified by PCR in a volume of 20 i±\ for 35 
cycles. The amplicons were denatured and electrophoresed for 3.h in an 
ABI Prism 377. Data were collected in the Gene Scan program for 
fragment analysis. Loss of heterozygosity was defined as less than 33% 
of one allele detected in tumor amplicons compared with blood. 

Proteomic Analysis— TCCs were minced into small pieces and 
homogenized in a small glass homogenizer in 0.5 ml of lysis solution. 
Samples were stored at -20 °C until use. The procedure for 2D gel 
electrophoresis has been described in detail elsewhere (15, 16). Gels 
were stained with silver nitrate and/or Coomassie Brilliant Blue. Pro- . 
teins were identified by a combination of procedures that included 
microsequencing, mass spectrometry, two-dimensional gel Western 
immunoblotting, and comparison with the master two-dimensional gel 
image of human Keratinocyte proteins; see biobase.dk/cgi-bin/celis. 

CGH- Hybridization of differentially labeled tumor and normal DNA 
to normal metaphase chromosomes was performed as described 
previously (10). Fluorescein-labeled tumor DNA (200 ng), Texas Red- 



labeled reference DNA (200 ng), and human Cot-1 DNA (20 m9) w ® re 
denatured at 37 C C for 5 min and applied to denatured normal met- 
aphase slides. Hybridization was at 37 °C for 2 days. After washing, 
the slides were counterstained with 0.15 ^g/m\ 4,6-dtamidino-2-phe- 
nyiindoie in an anti-fade solution. A second hybridization was per- 
formed for ail tumor samples using fluorescein-labeled reference DNA 
and Texas Red-labeled tumor DNA (inverse labeling) to confirm the 
aberrations detected during the initial hybridization. Each CGH ex- 
periment also included a normal control hybridization using fluores- 
cein- and Texas Red-labeled normal DNA. Digital image analysis was 
used to identify chromosomal regions with abnormal fluorescence 
ratios, indicating regions of DNA gains and losses. The average 
green: red fluorescence intensity ratio profiles were calculated using 
four images of each chromosome (eight chromosomes total) with 
normalization of the green:red fluorescence intensity ratio for the 
entire metaphase and background correction. Chromosome identifi- 
cation was performed based on 4,6-diamidino-2-phenylindole band- 
ing patterns. Only images showing uniform high intensity fluores- 
cence with minimal background staining were analyzed. All 
centromeres, p arms of acrocentric chromosomes, and heterochro- 
matic regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization -The CGH analysis 
identified a number of chromosomal gains and losses in the 
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Tabih I 

Correlation between alterations detected by CGH and by expression monitoring 
Top, CGH used as independent variable (if CGH alteration - what expression ratio was found); bottom, altered expression used as 
independent variable (if expression alteration - what CGH deviation was found). 
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two invasive tumors (stage pT1 t TCCs 733 and 827),. whereas 
the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p-, 9q22-q33-, and X-, and 7 + . 9q~. 
and respectively. Both invasive tumors showed changes 
(1q22-24+, 2q14.1-qter-, 3q12-q13.3-, 6q12-q22-, 
9q34+, 11q12-q13+, 17 + . and 20q11 .2-q12+) that are typ- 
ical for their disease stage, as well as additional alterations, 
some of which are shown in Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4-fold 
in the case of TCC 733 and 0.3-fold for TCC 827. The largest 
changes, amounting to at least a doubling of chromosomal 
content, were observed at 1q23 in TCC 733 (Fig. ^A) and 
20q12 in TCC 827 (Fig. 18). 

mRNA Expression in Relation to DNA Copy Number— The 
mRNA levels from the two invasive tumors (TCCs 827 and 
733) were compared with the two non-invasive counterparts 
(TCCs 532 and 335). This was done in two separate experi- 
ments in which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1,800 genes that yielded a signal on the arrays 
were searched in the Unigene and Genemap data bases for 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their purported locus. In 
that way it was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level in 
the invasive versus the non-invasive counterpart. Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fold were regarded as informative (Fig. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the CGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
the chromosomal aberrations are not known at high resolution. 

Two sets of calculations were made from the data. For the 
first set we used CGH alterations as the independent variable 
and estimated the frequency of expression alterations in these 
chromosomal areas. In general, areas with a strong gain of 
chromosomal material contained a cluster of genes having 
increased mRNA expression. For example, both chromo- 
somes 1q21-q25, 2p and 9q, showed a relative gain of more 
than 100% in DNA copy number that was accompanied by 
increased mRNA expression levels in the two tumor pairs (Fig. 
1). In most cases, chromosomal gains detected by CGH were 
accompanied by an increased level of transcripts in both 
TCCs 733 (77%) and 827 (80%) {Table I, top). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression in several cases, and were often regis- 
tered as having unaltered RNA levels (Table t, top). The inabil- 
ity to detect RNA expression changes in these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

In the second set of calculations we selected expression 
alterations above 2-fold as the independent variable and es- 
timated the frequency of CGH alterations in these areas. As 
above, we found that increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected in areas with unaltered CGH ratios (Table I, bottom). 
Furthermore, as a control we looked at areas with no alter- 
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Fig. 2. Correlation between maximum CGH aberration and the ability to detect expression change by oligonucleotide array 
monitoring. The aberration is shown as a numerical -fold change in ratio between invasive tumors 827 {A) and 733 (♦) and their non-invasive 
counterparts 532 and 335. The expression change was taken from the Expression line to the right in Fig, 1, which depicts the resulting 
expression change for a given chromosomal region. At least half of the mRNAs from a given region have to be either up- or down-regulated 
to be scored as an expression change. All chromosomal arms in which the CGH ratio plus or minus one standard deviation was outside the 
ratio value of one were included. 



ation in expression. No alteration was detected by CGH in 
most of these areas (TCC 733, 60% and TCC 827, 81 %; see 
Table I, bottom). Because the ability to observe reduced or 
increased mRNA expression clustering to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations in the 
regions showing CGH changes against the ability to detect a 
change in mRNA expression as monitored by the oligonucleo- 
tide arrays (Fig. 2)LF.or both tumors TCC 733 (p < 0.015) and 
TCC 827 (p < 0.00003) a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
the DNA copy number) and alterations detected by the array 
based technology (Fig. 2), Similar data were obtained when 
areas with altered expression were used as independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1 .6- to 2.0-fold (Table I, bottom) but mostly did 
not at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction in expres- 
sion level, which is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
much larger extent. 

Microsatellite-based Detection of Minor Areas of Loss- 
es — In TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Fig. 1 , TCC 733 
chromosome 1q32, 2p21, and 7q21 and q32, 9q34, and 
10q22). To determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two microsatellites positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Fig. 3). Additionally, chromosome 2p in 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript, increase/decrease/in- 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Fig. ^A). As indicated above, the mRNA decrease observed in 
the middle of the chromosomal gain was because of LOH, 
implying that one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material. However, this cannot be detected with 
the resolution of the CGH method. 

In both TCC 733 and TCC 827, the telomeric end of chro- 
mosome lip showed a normal ratio in the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two microsatellites (D1 1S1760, D1 1S922) 
positioned close to MUC2, IGF2; and cathepsin D indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown}. 

A reduced expression of mRNA observed in TCC 733 at 
chromosomes 3q24, 11p11. 12p12.2, 12q21.1, and 16q24 
and in TCC 827 at chromosome 11p15.5, 12p11, 15q11.2, 
and 18q12 was also examined for chromosomal losses using 
microsatellites positioned as close as possible to the gene loci 
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Fig. 3. Microsateiiite analysis of loss of heterozygosity. Tumor 
733 showing loss of heterozygosity at chromosome 1q25, detected 
(a) by D1S215 close to Hu class I histocompatibility antigen (gene 
number 38 in Fig. 1), [b) by D1S2735 close to cathepsin E (gene 
number 41 in Fig. 1), and (c) at chromosome 2p23 by D2S2251 close 
to general /3-spectrin (gene number 1 1 on Fig. 1) and of (d) tumor 827 
showing toss of heterozygosity at chromosome 18q12 by S18S1 1 18 
close to mitochondrial 3-oxoacyi-coenzyme A thiolase (gene number 
12 in Fig. 1). The upper curves show the electropherogram obtained 
from normal DNA from leukocytes (A/), and the lower curves show the 
electropherogram from tumor DNA (7). In all cases one allele is 
partially lost in the tumor ampiicon. 

showing reduced mRNA transcripts. Only the microsateiiite 
positioned at 18q12 showed LOH (Fig. 3), suggesting that 
transcriptional, down-regulation of genes in the other regions 
may be controlled by other mechanisms. 

Relation between Changes in mRNA and Protein Levels— 
2D-PAGE analysis, in combination with Coomassie Brilliant 
Blue and/or silver staining, was carried out on all four tumors 
using fresh biopsy material. 40 well resolved abundant known 
proteins migrating in areas away from the edges of the pH 
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Fig. 4. Correlation between protein levels as judged by 20- 
PAGE and transcript ratio. F t or comparison proteins were divided in 
three groups, unaltered in level or up- or down-reguiated {horizontal 
axis). The mRNA ratio as determined by oligonucleotide arrays was 
plotted for each gene {vertical ax/sj. A, mRNAs that were scored as 
present in both tumors used for the ratio calculation; A, mRNAs that 
were scored as absent in the invasive tumors (along horizontal axis) or 
as absent in non -invasive reference (fop of figure). Two different 
scalings were used to exclude scaling as a confounder, TCCs 827 
and 532 (AA) were scaled with background suppression, and TCCs 
733 and 335 (0O) were scaled without suppression. Both compari- 
sons showed highly significant tp < 0.005) differences in mRNA ratios 
between the groups. Proteins shown were as follows: Group A (from 
left), phosphoglucomutase 1 , glutathione transferase class m number 
4, fatty acid-binding protein homologue, cytokeratin 15, and cyto- 
keratin 13; S (from left), fatty acid-binding protein homologue, 28-kDa 
heat shock protein, cytokeratin 13, and calcyclin; C-(from left), <>-eno- 
lase, hnRNP B1, 28-kDa heat shock protein, 14-3-3-fe, and 
pre-mRNA splicing factor; D, mesothelia! keratin K7 (type II); E (from 
fop), glutathione S-1ransf erase- tt and mesothelia! keratin K7 (type II); 
F (from top and left), adenylyl cyclase-associated protein, E-cadherin, 
keratin 19, calgizzarin, phosphoglycerate mutase, annexin IV, cy- 
toskeletal y-actin, hnRNP A1, integral membrane protein calnexin 
(IP90), hnRNP H, brain-type clathrin light chain-a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear ribonucleoprotein A/B, 
translationally controlled tumor protein, liver glyceraldehyde-3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na.K- 
ATPase /3-1 subunit; G, (from top and left), TCP20, calgizzarin, 70- 
kDa heat shock protein, calnexin, hnRNP H, cytokeratin 15, ATP 
synthase, keratin 19, triosephosphate isornerase, hnRNP F, liver give - 
eraldehyde-3-phosphatase dehydrogenase, glutathione S-transfer- 
ase-Tr, and keratin 8; H (from left), plasma gelsolin, autoantigen cal- 
reticulin. thioredoxin, and NAD+ -dependent 15 hydroxy prostaglandin 
dehydrogenase; / (from fop), prolyl 4-hydroxylase 0-subunit, cyto- 
keratin 20, cytokeratin 17, prohibition, and fructose 1,6-biphos- 
phatase; J annexin II; K, annexin IV; L (from top and left), 90-kDa heat 
shock protein, prolyl 4-hydroxylase 0-subunit f u-enolase, GRP 78, 
cyclophilin, and cofilin. 

gradient, and having a known chromosomal location, were 
selected for analysis in the TCC pair 827/532. Proteins were 
identified by a combination of methods (see "Experimental 
Procedures"). In genera! there was a highly significant corre- 
lation (p < 0.005) between mRNA and protein alterations (Fig. 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except for a group of cyto- 
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Fig. 5. Comparison of protein and transcript levels in invasive 
and non-invasive TCCs, The upper part of the f igure shows a 2D gel 
(/eft) and the oligonucleotide array {right) of TCC 532. The red rectan- 
gles on the upper gel highlight the areas that are compared below. 
Identical areas of 2D gels of TCCs 532 and 827 are shown below. 
Clearly, cytokeratins 13 and 15 are strongly down-regulated in TCC 
827 (red annotation). The tile on the array containing probes for 
cytokeratin 1 5 is enlarged below the array (red arrow) from TCC 532 
and is compared with TCC 827. The upper row of squares in each tile 
corresponds to perfect match probes; the lower row corresponds to 
mismatch probes containing a mutation (used for correction for un- 
specific binding). Absence of signal is depicted as black, and the 
higher the signaJ the lighter the color. A high transcript level was 
detected in TCC 532 (6151 units) whereas a much lower level was 
detected in TCC 827 (absence of signals). For cytokeratin 13, a high 
transcript level was also present in TCC 532 (15659 units), and a 
much lower level was present in TCC 827 (623 units). The 2D gels at 
the bottom of the figure {left) show levels of PA-FABP and adipocyte- 
FABP in TCCs 335 and 733 (invasive), respectively. Both proteins are 
down-regulated in the-invasive tumor. To the right we show the array 
tiles for the PA-FABP transcript, A medium transcript level was de- 
tected in the case of TCC 335 (1277 units) whereas very low levels 
were detected in TCC 733 (166 units). IEF t isoelectric focusing. 



keratins encoded by genes on chromosome 17 (Fig. 5) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 
location were detected in TCCs 733 and 335, and of these 19 
correlated ip < 0.005) with the mRNA changes detected using 
the arrays (Fig. 4). For example, PA-FABP was highly ex* 
pressed in the non-invasive TCC 335 but lost in the invasive 
counterpart (TCC 733; see Fig. 5). The smaller number of 
proteins detected in both 733 and 335 was because of the 
smaller size of the biopsies that were available. 

1 1 chromosomal regions where CGH showed aberrations 
that corresponded to the changes in transcript levels also 
showed corresponding changes in the protein level (Table II). 
These regions included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratins 17 and 20, annexins II and IV, and the fatty 
acid-binding proteins PA-FABP and FBP1. Four of these pro- 
teins were encoded by genes in chromosome 17q, a fre- 
quently amplified chromosomal area in invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DNA content, , having 
tost some chromosomal parts and gained others. The present 
study provides some evidence as to the effect of these gains 
and losses on gene expression in two pairs of non-invasive 
and invasive TCCs using high throughput expression arrays 
and proteomics, in combination with CGH. In general, the 
results showed that there is a clear individual regulation of the 
mRNA expression of single genes, which in some cases was 
superimposed by a DNA copy number effect. In most cases, 
genes located in chromosomal areas with gains often exhib- 
ited increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fold change, 
thus being at the border of detection. In several cases, how- 



Protein 



Table II 

Proteins whose expression level correlates with both mRNA and gene dose changes 



Chromosomal location 



Annexin II 
Annexin IV 
Cytokeratin 17 
Cytokeratin 20 
(PA-)FABP 
FBP1 

Plasma gelsolin 
Heat shock protein 28 
Prohibitin 
Prolyl-4-hydroxyl 
hnRNPBI 



1q2l 
2p13 

17q12-q21 

17q21.1 

Bq21.2 

9q22 

9q31 

15q12-q13 
17q2l 
17q25 
7p15 



Tumor TCC 


CGH alteration 


Transcript alteration" 


Protein alteration 


733 


Gain 


Abs to Pres" 


Increase 


733 


Gain 


3.9-Fold up 


Increase 


827 


Gain 


3.8-Fold up 


Increase 


827 


Gain 


5.6-Fold up 


Increase ■ 


827 


Loss 


10-Fold down 


Decrease 


827 


Gain 


2.3-Fold up 


Increase 


827 


Gain 


Abs to Pres 


Increase 


827 


Loss 


2.5-Fold up 


Decrease 


827/733 


Gain 


3.7-/2.5-Fold up'' 


Increase 


827/733 


Gain 


5.7-/1 .6-Fold up 


Increase 


827 


Loss 


2.5-Fold down 


Decrease 



" Abs, absent; Pres. present. 

' J In cases where the corresponding alterations were found in both TCCs 827 and 733 these are shown as 827/733. 
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ever, an increase or decrease in DNA copy number was 
associated with de novo occurrence or complete loss of tran- 
script, respectively. Some of these transcripts could not be 
detected in the non-invasive tumor but were present at rela- 
tively high levels in areas with DNA amplifications in the inva- 
sive tumors (e.g. in TCC 733 transcript from cellular ligand of 
annexin II gene (chromosome 1q21) from absent to 2670 
arbitrary units; in TCC 827 transcript from small proiine-rich 
protein 1 gene (chromosome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an increased expres- 
sion to a certain chromosomal area indicates an increased 
likelihood of gain of chromosomal material in this area. 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may lie in the loss of controlled 
methylation in tumor cells (17-19). Thus, it may be possible 
that in chromosomes with increased DNA copy numbers two 
or more alleles could be demethyiated simultaneously leading 
to a higher transcription level, whereas in chromosomes with 
losses the remaining allele could be partly methylated, turning 
off the process (20, 21). A recent report has documented a 
ploidy regulation of gene expression in yeast, but in this case all 
the genes were present in the same ratio (22), a situation that is 
not analogous to that of cancer cells, which show marked 
chromosomal aberrations, as well as gene dosage effects. 

Several CGH studies of bladder cancer have shown that 
some chromosomal . aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3 tumors. In pTa tumors, these include 9p~, 9q~, 1q + , Y- 
(2, 6), and in pT1 tumors, 2q~,11p-, 11q-, 1q+, 5p+, 8q + , 
17q+ f and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p~ and 9q22-q33- and 
9q- and Y-, respectively. Likewise, the two minimal invasive 
pT1 tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1q22-24 
amplification (seen in both tumors), 1 1q14-q22 loss, the latter 
often linked to 17 q+ (both tumors), and 1q+ and 9p~, often 
linked to 20q+ and 11 q13+ (both tumors) (7-9). These ob- 
servations indicate that the pairs of tumors used in this study 
exhibit chromosomal changes observed in many tumors, and 
therefore the findings could be of general importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 megabases it is only possible to get a crude picture of 
chromosomal instability using this technique. Occasionally, 
we observed reduced transcript levels close to or inside re- 
gions with increased copy numbers. Analysis of these regions 
by positioning heterozygous microsatellites as close as pos- 
sible to the locus showing reduced gene expression revealed 
loss of heterozygosity in several cases. It seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use of cDNA microarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations. At present we do not know the mechanism be- 
hind chromosomal aneuploidy and cannot predict whether 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic imprinting has 
an impact on the expression level In norma) cells and is often 
reduced in tumors. However, the relation between imprinting 
and gain of chromosomal material is not known. 

We regard it as a strength of this investigation that we were 
able to compare invasive tumors to benign tumors rather than 
to normal urothelium, as the tumors studied were biologically 
very close and probably may represent successive steps in 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available it was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and mRNA expression is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This indicate that 
bladder tumors are relatively homogenous, a notion recently 
supported by CGH and LOH data that showed a remarkable 
similarity even between tumors and distant metastasis (10, 23). 

In the few cases analyzed, mRNA and protein levels 
showed a striking correspondence although in some cases 
we found discrepancies that may be attributed to transiationa! 
regulation, post-translationai processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertranslated mRNA pools, which are associated with few 
translationally inactive ribosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very important in the case of polypeptides with a short 
half-life (e.g. signaling proteins). A poor correlation between 
mRNA and protein levels was found in liver cells as deter- 
mined by arrays and 2D-PAGE (25), and a moderate correla- 
tion was recently reported by Ideker ef a/. (26) in yeast. 
( interestingly, our study revealed a much better correlation 
between gained chromosomal areas and increased mRNA 
levels than between loss of chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
the ability to detect a change in transcript': One possible 
explanation could be that by losing one allele the change in 
mRNA level is not so dramatic as compared with gain of 
material, which can be rather unlimited and may lead to a 
severalfold increase in gene copy number resulting in a much 
higher impact on transcript level. The latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2-fold level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may in the future allow detection of lesser than 2-fold 
alterations in transcript levels, a feature that may facilitate the 
analysis of the effect of loss of chromosomal areas on tran- 
script levels. 
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In eleven cases we found a significant correlation between 
DNA copy number, mRNA expression, and protein level. Four 
of these proteins were encoded by genes located at a fre- 
quently amplified area in chromosome 17q. Whether DNA 
copy number is one of the mechanisms behind alteration of 
these eleven proteins is at present unknown and will have to 
be proved by other methods using a larger number of sam- 
ples. One factor making such studies complicated is the large 
extent of protein modification that occurs after translation, 
requiring immunoidentification and/or mass spectrometry to 
correctly identify the proteins in the gels. 

In conclusion, the results presented in this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on microarrays with many thousand radiation 
hybrid-mapped genes will increase the resolution and informa- 
tion derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived from genes with 
known locations, and 2D gel analysis to obtain information at 
the post-translational level, a clearer and more developed un- 
derstanding of the tumor genome will be forthcoming. 
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ABSTRACT 

Genetic changes underlie tumor progression and may lead to cancer- 
specific expression of critical genes. Over 1100 publications have de- 
scribed (he use of comparative genomic hybridization (CCH) to analyze 
the pattern of copy number alterations in cancer, but very few of the aenes 
affected are known. Here, we performed high-resolution COM analysis on 
cDNA microarrays in breast cancer and directly compared copy number 
and mRNA expression levels of 13,824 genes to qtiantitate the impact of 
genomic changes on gene expression. We identified and mapped the 
boundaries of 24 independent amplicons, ranging in size from 0.2 to 12 
Mb. Throughout the genome, both high- and low-level copy number 
changes had a substantial impact on gene expression, with 44% of the 
highly amplified genes showing nvercx press ion and 10.5% of the highly 
overexposed genes being amplified. Statistical analysis with random 
permutation tests identified 270 genes whose expression levels across 14 
samples were systematically attributable to gene amplification. These 
included most previously described amplified genes in breast cancer and 
many novel targets for genomic alterations, including the HOXR7 gene, 
the presence of which in a novel amplicon at 1 7 q 2 1.3 was validated in 
10.2% of primary breast cancers and associated with poor patient prog- 
nosis. In conclusion, CGH on cDNA microarrays revealed hundreds of 
novel genes whose ovcrcxpression is attributable to gene amplification. 
These genes may provide insights to the clonal evolution and progression 
of breast cancer and highlight promising therapeutic targets. 

INTRODUCTION 

Gene expression patterns revealed by cDNA microarrays have 
facilitated classification of cancers into biologically distinct catego- 
ries, some of which may explain the clinical behavior of the tumors 
(1-6). Despite this progress in diagnostic classification, the molecular 
mechanisms underlying gene expression patterns in cancer have re- 
mained elusive, and the utility of gene expression profiling in the 
identification of specific therapeutic targets remains limited. 

Accumulation of genetic defects is thought to underlie the clonal 
evolution of cancer. Identification of the genes mat mediate the effects 
of -genetic changes may be important by highlighting transcripts that 
are actively involved in tumor progression. Such transcripts and their 
encoded proteins would be ideal targets for anticancer therapies, as 
demonstrated bv the clinical success of new therapies against ampli- 
fied oncogenes,' such as ERBB2 and EGFR (7. 8), in breast cancer and 
other solid tumors. Besides amplifications of known oncogenes, over 
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Fig 1 Impact nf ncne copy number on global gene expression levels. A, percentage of 
over- and undercxprcsscd genes (Y a*h) according to copy number ratios (A' 
Threshold values used for over - and und ere agression were >2.184 (global upper 7% ot 
the cONA ratios) and <0.4K2b (global lower 7% of the expression ratios). B. percentage 
of amplified and deleted, genes according to expression ratios. Threshold values for 
amplification and deletion were >L5 and *'0.7. 



20 recurrent regions of DNA amplification have been mapped in 
breast cancer by CGH 5 (9, 10). However, these amplicons are often 
large and poorly defined, and their impact on gene expression remains 
unknown. 

We hypothesized that genome-wide identification of those gene 
expression changes that are attributable to underlying gene copy 
number alterations would highlight transcripts that are actively in- 
volved in the causation or maintenance, of the malignant phenotype. 
To identify such transcripts, we applied a combination of cDNA and 
CGH microarrays to: (<a) determine the global impact that gene copy 
number variation plays in breast cancer development and progression; 
and (6) identify and "characterize those genes whose mRNA expres- 



• The abbreviations used are: CGH. comparative genomic hybridization; FISH, fluo- 
rescence in situ hybridisation; RT-PCR. reverse nunscripliort-PCR. 
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sion is most significantly associated with amplification of the corre- 
sponding genomic template, 

MATERIALS AND METHODS 

Breast Cancer Cell Lines. Fourteen breast cancer cell lines (BT-20. BT- 
474, HCC1428. Hs578t. MCF7, MDA-361. MDA-436, MDA-453. MDA-46S, 
SKBR-3, T-47D, UACC812, ZR-75-1, and ZR-75-30) were obtained from ihe 
American Type Culrurc Colleciion (Manassas, VA). Cells were grown under 
recommended culture conditions. Genomic DNA and mRNA were isolated 
using standard protocols. 

Copy Number and Expression Analyses by cONA Microarrays. The 
preparation and printing of the 13.824 cDNA clones on glass slides were 
performed as described*! 1 1-13). Of these clones, 244 represented uncharac- 
terized expressed sequence tags, and the remainder corresponded to known 
genes. CGH experiments on cDNA microarrays were done as described (14. 
15). Briefly, 20 of genomic DNA from breast cancer cell lines and normal 
human WBCs were digested for 14-1 S h with Alid and Rsal (Life Technol- 
ogies, Inc., Rockville, MD) and purified by phenol/chloroform extraction. Six 
fig of digested cell line DNAs were labeled with Cy3-dUTP (Amersham 
Pharmacia) and normal DNA with Cy5-dUTP (Amersham Pharmacia) using 
the Bioprime Labeling kit (Life Technologies, Inc.). Hybridization ( 14, 15) and 
posthybridizaiion washes (13) were done as described. For the expression 
analyses, a standard reference (Universal Human Reference RNA; Siraiagcne, 
La Jolla, CA) was used in alt experiments. Forty /*g of reference RNA were , 
labeled with Cy3-dUTP and 3.5 /xg of lest mRNA with Cy5-dUTI\ and the 
labeled cDNAs were hybridized on microarrays as described (13,1 5). For both 
microarray analyses, a laser con focal scanner (Agilent Technologies, Palo 
Alto, CA) was used to measure the fluorescence intensities at the target 
locations using the DEARRAY software (16). After background subtraction, 
average intensities at each clone in the test hybridization were divided by ihc 
average intensity of the corresponding clone in the control hybridization. For 
the copy number analysis, the ratios were normalized on the basis of the 
distribution of ratios of all targets on the array and for the expression analysis 
on the basis of 8A housekeeping genes, which were, spoiled four times onto the 
arruv. Low quality measurements (i.e.. copy number data with mean reference 
intensity <I00 fluorescent units, and expression data with both test and 
reference intensity <10() fluorescent units ami/or with spot size <50 units) 
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were excluded from the analysis and were treated as missing values. The 
distributions of fluorescence ratios were used to define curpoinls for increased/ 
decreased copy number. Genes with CGH ratio >1.43 (representing the upper 
5% of the CGH ratios across all experiments) were considered to be amplified, 
and genes with ratio <0.73 (re presenting the lower 5%) were considered to be 
deleted. 

Statistical Analysis of CC'II and cDNA Microarray Data. To evaluate 
the influence of copy number alterations on gene expression, we applied the 
following statist i cat approach. CGH and cDNA calibrated intensity ratios were 
log-transformed and normalized using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across all 14 cell lines were 
median centered. For each gene, the CGH data were represented by a vector 
that was labeled 1 for amplification (ratio, >1.43) and 0 for no amplification. 
Amplification was correlated with gene expression using the signal-to-noise 
statistics (I). We calculated a weight. w r for each gene as follows: 

m H i " niyo 

where m p) . <r rl and m^, ^ denote the means and SDs for the expression 
levels for amplified and nonampliftcd cell lines, respectively. To assess the 
statistical significance of each weight, we performed 10,000 random permu- 
tations of the label vector. The probability that a gene had a larger or equal 
weight by random permutation than die original weight was denoted by nr.. A 
low a (<0.05) indicates a strong association between gene expression and 
amplification. 

Genomic Localization of cDNA Clones and Amplieon Mapping. Each 
cONA clone on the microarray was assigned to a Unigene cluster using the 
Unigene Build 14).° A database of genomic sequence alignment information 
for in RNA sequences was created from the August 2001 freeze of the Uni- 
versity of California Santa Cruz's GoldenPulh database, 7 The chromosome and 
bp positions for each cl)NA clone were then retrieved by relating these data 
sets. Amplicons were defined as a CGH copy number ratio >2.0 in at least two 
adjacent clones in two or more cell lines or a CGH ratio >2.0 in at least three 
adjacent clones in a single cell line. The amplieon stan and end positions were 



" Iniernei address: hup•y/rt^scarc^l.nllgri.ni)^yov/nlic^o^t^ray/downloa^lahlt;^cd^a.ht^l!. 
' Internet address: www.gcnomc.ucse.edu. 
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Tabic I Summary of independent amplicons in 14 breast cantvr cell tines hy 
CGH microarray 



Location 



lp)3 
lq2l 
lq22 
3pl4 

7pl2.1-7pH.2 

7q31 

7q32 

8q21.li-8q21.l3 
8q2 1 .3 

■ 8q23.3-8q24.l4 
8q24.22 
9 P 13 

Ug22-<i3l 
I6q22 
I7q! 1 

17ql2^q2l.2 
17q21.32-q2 
I7q22-q23.3 
I7q23.3-<j24.3 
19ql3 
2Ck4l 1.22 
20ql3.l2 
,20ql3. 12-^13.13 
20ql3.2-ql3.32 



33 



Sian (Mb) 



132.79 
173.92 
179.28 
71.94 
55.62 
125.73 
(40.01 
86.45 
98.45 
129.88 
151.2! 
38.65 
77.15 
86.70 
29.30 
39.79 
52.47 
63.81 
69.93 
4(1.63 
34.5y 
44.00 
46.45 
51.32 



End (Mb) 



132.94 
177.25 
179.57 
74.66 
60.95 
(30.96 
140.68 i 
92.46 
103.05 
142.15 
152.16 
39.25 
81.38 
87.62 
30.85 
42.80 
55.80 
69.70 
74.99 
4 1.40 
35.85 
45.62 
49.43 
59. 1 2 



Size (Mb) 



0.2 
3.3 
0.3 
2.7 
5.3 
5.2 
0.7 
6.0 
4.6 
12.3 
1.0 
0.6 
4.2 
0.9 
1.6 
3.0 
3.3 
5.9 
5.1 
0.8 
1.3 
1.6 
3.0 
7.8 



CGH were validated,, with lq21, I7ql2-q2l.2. 17q22~q23, 20ql3.I, 
and 20ql3.2 regions being most commonly amplified. Furthermore, 
the boundaries of these amplicons were precisely delineated. In ad- 
dition, novel amplicons were identified at 9p!3 (38.65-39.25 Mb), 
and 17q21.3 (52.47-55.80 Mb). 

Direct Identification of Putative Amplification Target Genes, 
The cDNA/CGH microarray technique enables the direct correla- 
tion of copy number and expression data on a genc-by-gene basis 
throughout the genome. We directly annotated high-resolution 
CGH plots with gene expression data using color coding. Pig. 2C 
shows that most of the amplified genes in the MCK-7 breast cancer 
cell line at Ipl3, 17q22-q23, and 20ql3 were highly overex- 
pressed. A view of chromosome 7 in the MDA-468 cell line 
implicates EGFR us the most highly overexpressed and amplified 
gene at 7pl !--pl2 (Fig. 3/1). In BT-474, the two known amplicons 
at 17ql2 and I7q22-q23 contained numerous highly overex- 
pressed genes (Fig. 3B). In addition, several genes, including the 
homeobox genes HOXB2 and HOXB7, were highly amplified in a 
previously undesenbed independent amplicon at I7q2!.3. HOXB7 
was systematically amplified (as validated by FISH, Fig. 3#. inset) 
as well as overexpressed (as verified by RT-PCR, data not shown) 
in HT-474. UACC812. and ZR -75-30 cells. Furthermore, this novel 




extended to include neighboring nonamplified clones (ratio. <l.5). The am- 
plicon size determination was partially dependent on local clone density. 

FISH. Dual-color interphase FISH to breast cancer cell lines was done as 
described (17). Bacterial artificial chromosome clone RP1I-361K8 was la- 
beled with SpectrumOrangc {Vysis. Downers Grove, IL), and Spectrum- 
Orange-labeled probe for EGFR was obtained 'from Vysis. SpectrumGrecn- 
labeled chromosome 7 and 17 centromere probes • (Vysis) were used as a 
reference. A tissue microarray containing 612 formalin-fixed, paraffin -embed- 
ded primary breast cancers (17) was applied in FISH analyses as described 
(1 8). The use of these specimens was approved by the Ethics Committee of the 
University of Base! and by the NIH. Specimens containing a 2-fold or higher 
increase in the number of test probe signals, as compared with corresponding 
centromere signals, in at least 10% of ihe tumor ceils were considered to be 
amplified. Survival analysis was performed using the Kaplan-Meier method 
and the log-rank test. 

RT-PCR. The HOXB? expression level was determined relative to 
GAPDH, Reverse transcription and PCR amplification wen; performed using 
Access RT-PCR System (Promega Corp., Madison, Wl) with 10 ng of mRNA 
as a template. HOXB 7 primers were 5'-GAGCAGAGGGACTCGGACTT-.V 
and 5 ' -GCGTCAGGTAGCG A TTGT AG- J '. 

RESULTS * 

Global Effect of Copy Number on Ccne Expression. 13,824 
arrayed cDNA" clones were applied for analysis of gene expression 
and gene copy number (CGH microarray s) in 14 breast cancer cell 
lines. The results illustrate a considerable influence of copy number 
on gene expression patterns. Up to 44% of the highly amplified 
transcripts (CGH ratio, >2.5) were overexpressed (i.e.. belonged to 
the global upper 7% of expression ratios), compared with only 6% for 
genes with normal copy number levels (Fig. IA). Conversely, 10.5% 
of the transcripts with high-level expression (cDNA ratio. >10) 
showed increased copy number (Fig. \B). Low-level copy number 
increases and decreases were also associated with similar, although 
less dramatic, outcomes on gene expression (l ; ig. !)■ 

Identification of Distinct Breast Cuneer Amplicons. Base-pair 
locations obtained for 1 1,994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig. 2, Supple- 
ment Fig. A). The average spacing of clones throughout the genome 
was 267 kb. This high-resolution mapping, identified 24 independent 
breast cancer amplicons, spanning from 0.2 to 12 Mb of DNA (Table 
I), Several amplification sites detected previously by chromosomal 
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l ; ig. ~S. Annotation of gene expression dutH on CGH microarray profiles. A, genes in the 
7p 1 1 -pi 2 omplicon in the; MDA-4G8 celt tine arc highly expressed (redouts) nnd include 
the EGFR oncogene. B, several genes in the I7ql2. 1 7q2 i .3, and 170,23 amplicons in the 
[VM74 breast cancer cell line ure highly overexpressed {red) and include the HOXB? 
gene, The data labels and color coding ;irc as indicated for Fig.. 2C insets show 
chromosomal CGH profiles for the corresponding chromosomes and validation of the 
increased copy numher by interphase FISH (ising EGFR ired) and chromosome 7 
cemromcrc probe i^reen) to MDA-46S {A) and HOXB 7-spcci fie probe \red) and chro- 
mosome 17 centromere {green) to BT-474 cells (B). 
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Fig. 4. List of SO genes with a statistically 
significant correlation (« value <0.05) between 
gene copy number and gene expression. Name, 
chromosomal location, and the a value for each 
gene arc indicated. The genes have been ordered 
according to their position in the genome. The color 
maps on the right illustrate the copy number and 
expression ratio patterns in the 14 cell lines. The 
key to the color code is shown at the bottom of the 
graph. Gray squares, missing values. The complete 
list of 270 genes is shown in supplemental Fig. B. 
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amplification was validated to be present in 10.2% of 363 primary 
breast cancers by FISH to a tissue microarray and was associated 
with poor prognosis of the patients (P = 0.001). 

Statistical Identification and Characterization of 270 Highly 
Expressed Genes in Amplicons. Statistical comparison of expres- 
sion levels of all genes as a function of gene amplification identified 
270 genes whose expression was significantly influenced by copy 
number across al! 14 cell lines (Fig. 4, Supplemental Fig. B). Accord- 
ing to the gene ontology data/ 91 of the 270 genes represented 
hypothetical proteins or genes with no functional annotation, whereas 
179 had associated functional information available. Of these, 151 
(84%) arc implicated in apoptosis. cell proliferation, signal transduc- 
tion, and iranseriplion. whereas 28 (16%) had functional annotations 
that could not be directly linked with cancer. 



DISCUSSION 

The importance of recurrent gene and chromosome copy number 
changes in the development and progression of solid tumors has been 
characterized in >100() publications applying CGH 9 (9, 10), as well 
as in a large number of other molecular cytogenetic, cytogenetic, and 
molecular genetic studies. The effects of these somatic genetic 
changes on gene expression levels have remained largely unknown, 
although a few studies have explored gene expression changes occur- 
ring in specific amplicons (15, 19-21). Here, we applied genome- 
wide cDNA microarrays to identify transcripts whose expression 
changes were attributable to underlying gene copy number alterations 
in breast cancer. 

The overall impact of copy number on gene expression patterns was 
substantial with the most dramatic effects seen in the case of high- 
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level copy number increase. Low-level copy number gains and losses 
also had a significant influence on expression levels of genes in the 
regions affected, but these effects were more subtle on a gene-by-gene 
basis than those of high-level amplifications.- However, the impact of 
low-level gains on the dysregulation of gene expression patterns in 
cancer may be equally important if not more important than that of 
high-level amplifications. Ancuploidy and low-level gains and losses 
of chromosomal arms represent the most common types of genetic 
alterations in breast and other cancers and, therefore, have an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on the impact of aneuploidy on global gene expression pat- 
terns in yeast cells, acute myeloid leukemia, and a prostate cancer 
model system (22-24). 

The CGH microarray analysis identified 24 independent breast 
cancer amplicons. We defined the precise boundaries for many arn- 
" plicons detected previously by chromosomal CGH (9, 10, 25, 26) and 
also discovered novel amplicons that had not been detected previ- 
ously, presumably because of their small size (only 1-2 Mb) or close 
proximity to otheT larger amplicons. One of these novel amplicons 
involved the homeobox gene region at I7q21.3 and led to the over- 
expression of the HOXB7 and HOXB2 genes. The homeodomain 
transcription factors are known to be key regulators of embryonic 
development and have been occasionally reported to undergo aberrant 
expression in cancer (27, 28). HOXB7 transfection induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
tumorigenicity and angiogenesis in breast cancer (29-32). The pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for overexpressing HOXB7 in breast cancer and suggest that 
HOXB7 contributes to tumor progression and confers an aggressive 
disease phenotype in breast cancer. This view is supported by our 
finding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well as an association of amplification with poor prognosis 
of the patients. 

We carried out a systematic search to identify genes whose 
expression levels across all 14 cell lines were attributable to 
amplification status. Statistical analysis revealed 270 such genes 
(representing -2% of all genes on the array), including not only 
previously described amplified genes, such as HER-2, MYC, 
EGFR, ribosomal protein s6 kinase, and AJB3, but also numerous 
novel genes such as NRAS-related gene (lpl 3), syndecan-2 (8q22), 
and bone morphogenic protein (20ql3.l), whose activation by 
amplification may similarly promote breast cancer progression. 
Most of the 270 genes have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of them to be causally 
involved, it is intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in- cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biological insights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 

)n summary, wc demonstrate application of cDNA micronrrays 
to the analysis of both. copy number and expression levels of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once every 267 kb. This analysis provided: {a) evidence' of a 
prominent global influence of copy number changes on gene 
expression levels; (l>) a high-resolution map of 24 independent 
amplicons in breast cancer; and (c) identification of a set of 270 
genes, the overexpression of which was statistically attributable to 
gene amplification. Characterization of a novel arnplicon at 
17q21.3 implicated amplification and overexpression of the 
HOXB7 gene in breast cancer, including a clinical association 
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between HOXB7 amplification and poor patient prognosis. Overall, 
our results illustrate how the identification of genes activated by 
gene amplification provides a powerful approach to highlight 
genes with an important role in cancer as well as to prioritize and 
validate putative targets for therapy development. 
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Genomic DNA copy number alterations are key genetic events in 
the development and progression of human cancers. Here we 
report a genome-wide microarray comparative genomic hybrid- 
ization (array CGH) analysis of DNA copy number variation in 
a series of primary human breast tumors. We have profiled DNA 
copy number alteration across 6,691 mapped human genes, in 44 
predominantly advanced, primary breast tumors and 10 breast 
cancer cell lines. White the overall patterns of DNA amplification 
and deletion corroborate previous cytogenetic studies, the high- 
resolution (gene-by-gene) mapping of amplicon boundaries and 
the quantitative analysis of amplicon shape provide significant 
improvement in the localization of candidate oncogenes. Parallel 
microarray measurements of mRNA levels reveal the remarkable 
degree to which variation in gene copy number contributes to 
variation in gene expression in tumor cells. Specifically, we find 
that 62% of highly amplified genes show moderately or highly 
elevated expression, that DNA copy number influences gene ex- 
pression across a wide range of DNA copy number alterations 
(deletion, low-, mid- and high-level amplification), that on average, 
a 2-fold change in DNA copy number is associated with a corre- 
sponding 1.5-fold change in mRNA levels, and that overall, at least 
12% of all the variation in gene expression among the breast 
tumors is directly attributable to underlying variation in gene copy 
number. These findings provide evidence that widespread DNA 
copy number alteration can lead directly to global deregulation of 
gene expression, which may contribute to the development or 
progression of cancer, 

Conventional cytogenetic techniques, including comparative 
genomic hybridization (CGH) (1), have led to the identifi- 
cation of a number of recurrent regions of DNA copy number 
alteration in breast cancer cell lines and tumors (2-4). While 
some of these regions contain known or candid ate oncogenes 
[e.g.. FGFR1 (Spll). MYC (Sq24), CCNDI (Mql3). ERBB2 
(J7ql2). and ZNF217 (20ql3)j and tumor suppressor genes 
[RBI (13ql4) and TP53 (I7pl3)j, the relevant gene(s) within 
other regions (e.g., gain of lq. 8q22, and I7q22-24 ; and loss of 
8p) remain to be identified. A high-resolution genome-wide- 
map, delineating the boundaries of DNA copy number alter- 
ations in tumors, should facilitate the localization and identifi- 
cation of oncogenes and tumor suppressor genes in" breast 
cancer. In this study, we have created such a map, using 
array-based CGH (5-7) to profde DNA copy number alteration 
in a series of breast cancer cell lines and primary tumors. 

An unresolved question is the extent to which the widespread 
DNA copy number changes that we and others have identified 
in breast tumors alter expression of genes within involved 
regions. Because we had measured mRNA levels in parallel in 
the same samples (8), using the same DNA microarrays, we had 
an opportunity to explore on a genomic scale the relationship 
between DNA copy number changes and gene expression. From 



this analysis, we have identified a significant impact, of wide- 
spread DNA copy number alteration on the transcriptional 
programs of breast tumors. 

Materials and Methods 

Tumors and Cell lines. Primary breast tumors were predominantly 
large (>3 cm), intermediate-grade, infiltrating ductal carcino- 
mas, with more than 50% being lymph node positive. The 
fraction of tumor cells within specimens averaged at least 50%. 
Details of individual tumors have. been published (8, 9), and 
arc summarized in Table 1, which is published as supporting 
information on the PNAS web site, www.pnas.org. Breast cancer 
cell lines were obtained from the American Type Culture 
Collection. Genomic DNA was isolated either using Qiagen 
genomic DNA columns, or by phenol/chloroform extraction 
followed by ethanol precipitation. 

DNA Labeling and Microarray Hybridizations. Genomic DNA label- 
ing and hybridizations were performed essentially as described 
inlPollack et til (7). with slight modifications. Two micrograms 
of DNA was labeled in a total volume of 50 microliters and the 
volumes of all reagents were adjusted accordingly. "Test" DNA 
(from tumors and cell lines) was fluoresce ntly labeled (Cy5) and 
hybridized to a human cDNA microarray containing 6,691 
different mapped human genes (i.e., UniGene clusters). The 
"reference" (labeled with Cy3) for each hybridization was nor- 
mal female leukocyte DNA from a single donor. The fabrication 
of cDNA microarrays and the labeling and hybridization of 
mRNA samples have been described (8). 

Data Analysis and Map Positions. Hybridized arrays were scanned 
on a GenePix scanner (Axon Instruments, Foster City, CA), and 
fluorescence ratios (test/reference) calculated using sc analyze 
software (available at http://rana.lbl.gov). Fluorescence ratios 
were normalized for each array by setting the average log 
fluorescence ratio for all array elements equal to. 0. Measure- 
ments with fluorescence intensities more than 20% above back- 
ground were considered reliable. DNA copy number profiles 
that deviated significantly from background ratios measured in 
normal genomic DNA control hybridizations were interpreted as 
evidence of real DNA copy number alteration (see Estimating 
Significance of Altered Fluorescence Ratios in the supporting 
information). When indicated, DN*A copy number profiles are 
displayed as a moving average (symmetric 5-nearest neighbors). 
Map positions for arrayed human cDNAs-were assigned by 
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Fig. 1 . Genome-wide measurement of DNA copy number alteration by array CGH. (a) DNA copy number prof. Ics are illustrated for cell lines con taming different 
numbers of X chromosomes, for breast cancer cell lines, and for breast tumors. Each row represents a different cell line or tumor, and each column represents 
one of 6,69 1 different mapped human genes present on the microarray, ordered by genome map position from 1 pter through Xqter. Moving average (symmetr.c 
5-nearest neighbors) fluorescence ratios (test/reference) are depicted using a logrbased pseudocolor scale (indicated), such that red luminescence reflects 
fold-amplification, green luminescence reflects fold-deletion, and black indicates no change (gray indicates poorly measured data), {b) Enlarged view ot DNA 
copy number profiles across the X chromosome, shown for cell lines containing different numbers of X chromosomes. 



identifying the starting position of the best and longest match of 
any DNA sequence represented in the corresponding UniGene 
cluster (10) against the "Golden Path" genome assembly 
(http://genome.ucsc.edu/; Oct 7, 2000 Freeze), for UniGene 
clusters represented by multiple arrayed elements, mean fluo- 
rescence ratios (for all elements representing the same UniGene 
cluster) are reported. For mRNA measurements, fluorescence 
ratios are "mean-centered" (i.e., reported relative to the mean 
ratio across the 44 tumor samples). The data set described here 
can be accessed in its entirety in the supporting information. 

Results 

We performed CGH on 44 predominantly locally advanced, 
primary breast tumors and 10 breast cancer cell lines, using 
cDNA microarrays containing 6,691 different mapped human 
genes (Fig. la; also see Materials and Methods for details of 
microarray hybridizations). To take full advantage of the im- 
proved spatial resolution of array CGH, we ordered (fluores- 
cence ratios for) the 6,691 cDNAs according to the "Golden 
Path" (http://genome.ucsc.edu/) genome assembly of the draft 
human genome sequences (11). In so doing, arrayed cDNAs not 
only themselves represent genes of potential interest (e.g., 
candidate oncogenes within amplicons), but also provide precise 
genetic landmarks for chromosomal regions of amplification and 



deletion. Parallel analysis of DNA from cell lines containing 
different numbers of X chromosomes (Fig. 1£>), as we did before 
(7), demonstrated the sensitivity of our method to detect single- 
copy loss (45, XO), and 1.5- (47,XXX) ; 2- (48,XXXX), or 
2.5-fold (49.XXX.XX) gains (also see Fig. 5, which is published 
as supporting information on the PNAS web site). Fluorescence 
ratios were linearly proportional to copy number ratios, which 
were slightly underestimated, in agreement with previous ob- 
servations (7). Numerous DNA copy number alterations were 
evident in both the breast cancer cell lines and primary tumors 
(Fig. la), detected in the tumors despite the presence of euploid 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower in the tumor samples. DNA copy-number 
alterations were found in every cancer cell line and tumor, and 
on every human chromosome in at least one sample. Recurrent 
regions of DNA copy number gain and loss were readily iden- 
tifiable. For example, gains within lq, 8q, 17q, and 20q were 
observed in a high proportion of breast cancer cell lines/tumors 
(90%/69%, IOO%/47%, 100%/6()%, and 90%/44%, respective- 
ly), as were losses within ip, 3p. 8p, and 13q {80%/24%, 
H()%/22%, 80*&/22%\ and 70%/ 18%, respectively), consistent 
with/published cytogenetic studies (refs. 2-4; a complete listing 
of gains/losses is provided in Tables 2 and 3, which are published 
as supporting information on the FN AS web site). The total 
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" Fig 2 DNA copy number alteration across chromosome 8 by array CGH. (a) DNA copy number profiles are illustrated for cell lines containing different numbers 
o1 X chromosomes for breast cancer cell lines, and for breast tumors. Breast cancer cell lines and tumors are separately ordered by hierarchtcal clustering to 
highlight recurrent copy number changes. The 241 genes present on the microarrays and mapping to chromosome 8 are ordered by position along the 
chromosome Fluorescence ratios (test/reference) are depicted by a log; pseudocolor scale (indicated). Selected genes are indicated with color-coded text (red, 
increased- green decreased; black, no change; gray, not well measured) to reflect correspondingly altered mRNA levels (observed <n the majority of the subset 
of samples displaying the DNA copy number change). The map positions for genes of interest that are not represented on the rmcroarray are .nd.cated in the 
row above those genes represented on the array, (b) Graphical display of DNA copy number profile for breast cancer cell line SKBR3. Fluorescence ratios 
(tumor/normal) are plotted on a log? scale for chromosome 8 genes, ordered along the chromosome. 



number of genomic alterations (gains and losses) was found to 
be significantly higher in breast tumors that were high grade {P - 
0,008), consistent with published CGH data (3), estrogen recep- 
tor negative (P = 0,04), and harboring TP53 mutations (P - 
0.0006) (see Table 4, which is published as supporting informa- 
tion on the FN AS web site). 

The improved spatial resolution of our array CGH analysis is 
illustrated for chromosome 8, which displayed extensive DNA 
copy number alteration in our series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously 
uncharacterized oncogene (Fig. 2a). The complexity of amplicon 
structure is most easily appreciated in the breast cancer cell line 
SKBR3. Although a conventional CGH analysis of 8q in SKBR3 
identified only two distinct regions of amplification (12), we 
observed three distinct regions of high-level amplification (la- 
beled 1-3 in Fig. 2b). For each of these regions we can define the 



boundaries of the interval recurrently amplified in the tumors we 
examined; in each case, known or plausible candidate oncogenes 
can be identified (a description of these regions, as well as the 
recurrently amplified regions on chromosomes 17 and 20, can be 
found in Figs. 6 and 7, which are published as supporting 
information on the PNAS.web site). 

For a subset of breast cancer cell lines and tumors (4 and 37, 
respectively), and a subset of arrayed genes (6,095), mRNA 
levels were quantitatively measured in parallel by using cDNA 
microarrays (8). The parallel assessment of mRNA levels is 
useful in the interpretation of DNA copy number changes. For 
example, the highly amplified genes that arc also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
eo n. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides us the oppor- 
tunity to assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor cells. 

A strong influence of DNA copy number on gene expression 
is evident in an examination of the pseudocolor representations 
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Fig. 3. Concordance between DNA copy number and gene expression across chromosome 17. DNA copy number alteration {Upper and mRNA '^Mtovver) 
are illustrated lor breast cancer cel. lines and tumors. Breast cancer cel. .ines and tumors are separately ordered * 

identical sample order is maintained (Lower). The 354 genes present on the microarravs and mapp.ng to chromosome 1 7. and for ^f^)^^^ 
and mRNA levels were determined, are ordered by position along the chromosome; selected genes are indicated .n color-coded text {see F.g. 2 legend). 
Fluorescence ratios (test/reference) are depicted by separate iog 2 pseudocolor scales (indicated). 



of DNA copy number and mRNA levels for genes on chromo- 
some 17 (Fig. 3). The overall patterns of gene amplification and 
elevated gene expression arc quite concordant; i.e., a significant 
fraction of highly amplified genes appear to be correspondingly 
highly expressed. The concordance between high-level amplifi- 
cation and increased gene expression is not restricted to chro- 
mosome 17. Genome-wide, of 117 high-level DNA amplifica- 
tions (fluorescence ratios >4, and representing 91 different 
genes), 62% (representing 54 different genes; see Table 5, which 
is published as supporting information on the PNAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-centered fluorescence ratios >2), and 42% (rep- 
resenting 36 different genes) are found associated with compa- 
rably highly elevated mRNA levels (mean-centered fluorescence 
ratios >4). 

To determine the extent to which DNA deletion and lower- 
level amplification (in addition to high-level amplification) are 
also associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 cell lines and 37 tumors, across 6,095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low-, 
medium-, and high-level amplification (Fig. 4d)- For both the 



breast cancer cell lines and tumors, average mRNA levels 
tracked with DNA copy number across all five classes, in a 
statistically significant fashion (/ J values for pair-wise Students 
/ tests comparing adjacent classes: cell tines, 4 X 10" 19 , 1 X \0~ A , 

.5 x irr 5 , i x io-2; tunuirs , ] x 10 -4\ i x ict 214 . 5 x irr 41 . 

1 x tO'" 4 ). A linear regression of the average log(DNA copy 
number), for each class, against average log(mRNA level) 
demonstrated that on average, a 2-fold change in DNA copy 
number was accompanied by 1.4- and 1.5-fold changes in mRNA 
level for the breast cancer cell lines and tumors, respectively (Fig. 
4tf, regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy num- 
ber and mRNA level, each across the 37 tumor samples (Fig. 4b). 
The distribution of correlations forms a normal-shaped curve, 
bin with the peak markedly shifted in the positive direction from 
zero. This shift is statistically significant, as evidenced in a plot 
of observed vs. expected correlations (Fig. 4c), and reflects a 
pervasive global influence of DNA copy number alterations on 
gene expression. Notably, the highest correlations between DNA 
copv number and mRNA level (the right tail of the distribution 
in Fig. 4b) comprise both amplified and deleted genes (data not 
shown). Third, we used a linear regression model to estimate the 
fraction of all variation measured in mRNA levels among the 37 
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Fig 4 Genome-wide influence of DNA copy number alterations on mRNA levels, (a) For breast cancer cell lines (gray) and tumor samples (black), both 
. mean-centered mRNA fluorescence ralio (log, scale) quartiles (box plots indicate 25th, 50th, and 75th percentile) and averages (diamonds; /-value error bars 
indicate standard errors of the mean) are plotted for each of five classes of genes, representing DNA deletion (tumor/normal ratio < 0.8), no change (0.8-1.2), 
low- (1 2-2) medium- (2- 4) and high-level (>4) amplification. P values for pair-wise Student's f tests, comparing averages between adjacent classes (mov.ng 
left to right) are4 x 10"» 1 * 1<r« S x 10"\ 1 * 10-' (cell lines), and 1 x io~« 1 x i<r»\ 5 x 1 x«i<r* (tumors), (b) Distribution of correlates between 

DNA copy number and mRNA levels, for 6,095 different human genes across 37 breast tumor samples, (c) Plot of observed versus expected correlation coefficients. 
The expected values we.e obtained by randomization of the sample labels in ihe DNA copy number data set. The line of unity is moicated. (d) Percent variance 
in gene expression (among tumors) directly explained by variation in gene copy number. Percent variance explained (black line) and fraction of data retained 
(gray line) are plotted for different fluorescence intensity/background (a rough surrogate for signal/noise) cutoff values. Fract.on of data retained is relat.ve 
to the 1 2 intensity /background cutoff. Details of the linear regression model used to estimate the fraction of variation in gene expression attributable to 
underlying DNA copy number alteration can be found in the supporting information (see estimating the Fraction of Variation in Gene fxpress/on Attnbutable 
to Underlying DNA Copy Number Alteration). 



tumors that could be attributed to underlying variation in DNA 
copy number. From this analysis, we estimate that, overall, about 
1% of all of the observed variation in mRNA levels can be 
explained directly by variation in copy number of the altered 
genes (Fig. Ad). We can reduce the effects of experimental 
measurement error on this estimate by using only that fraction 
of the data most reliably measured (fluorescence intensity/ 
background >3); using that data, our estimate of the percent 
variation in mRNA levels directly attributed to variation in gene 
copy number increases to 12% (Fig. Ad). This still undoubtedly 
represents a significant underestimate, as the observed variation 
in global gene expression is affected not only by true variation in 
the expression programs of the tumor cells themselves, but also 
by the variable presence of non-tumor cell types within clinical 
samples. 

Discussion 

This genome-wide, array CGH analysis of DNA copy number 
alteration in a series of human breast tumors demonstrates the 
usefulness of defining amplicon boundaries at high resolution 
(gene-by-gene), and quantitatively measuring amplicon shape, to 
assist in locating and identifying candidate oncogenes. By ana- 
lyzing mRNA levels in parallel, we have also discovered that 
changes in DNA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 



cell lines and tumors. Although the DNA microarrays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we are examining such a large fraction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we are likely underestimating the 
contribution of DNA copy number changes to altered gene 
expression, we believe our findings are likely to be generalizable 
(but would nevertheless still be remarkable if only applicable to 
this set of -6,100 genes). 

In budding yeast, aneuploidy has been shown to result in 
chromosome-wide gene expression biases (13). Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene expression in cancer cells. In 
agreement' with our findings, Phillips et at. (14) have shown that 
with the acquisition of tumorigenicity in an immortalized pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a statistically significant respective increase and 
decrease in the average expression level of involved genes. In 
contrast, PUitzei at at. (15) recently reported that in metastatic 
colon tumors only -4% of genes within amplified regions were 
found more highly (> 2-fold) expressed, when compared with 
normal colonic epithelium. This report differs substantially from 
our finding that 62% of highly amplified genes in breast cancer 
exhibit at least 2-fold increased expression. These contrasting 
fintlings may reflect 'methodological differences between the 
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studies. For example, the study of Platzcr et at, (15) may have 
systematically under-measured gene expression changes. In this 
regard it is remarkable thai only 14 transcripts of many thousand 
residing within unamplified chromosomal regions were found to 
exhibit at least 4-fold altered expression in metastatic colon 
cancer. Additionally, their reliance on lower-resolution chromo- 
somal CGH may have resulted in poorly delimiting the bound- 
aries of high-complexity amplicons, effectively overcalling re- 
gions with amplification. Alternatively, the contrasting findings 
for amplified genes may represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNA copy number alteration has 
a large, pervasive and direct effect on global gene expression 
patterns in breast cancer has several important implications. 
Hirst, this finding supports a high degree of copy number- 
dependent gene expression in tumors. Second, it suggests that 
most genes are not subject to specific autoregulation or dosage 
compensation. Third, this finding cautions that elevated expres- 
sion of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in tumor- 
igenesis. In our study, fully 62% of highly amplified genes 
demonstrated moderately or highly elevated expression. This 
highlights the importance of high -resolution mapping of ampli- 
con boundaries and shape [to identify the "driving" gene(s) 
within amplicons (16)], on a large number of samples, in addition 
to functional studies. Fourth, this finding suggests thai analyzing 
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the genomic distribution of -expressed genes, even within existing 
microarray gene expression data sets, may permit the inference 
of DNA copy number aberration, particularly a neuploidy (where 
gene expression can be averaged across large chromosomal 
regions; see Fig. 3 and supporting information). Fifth, this 
finding implies that a substantial portion of the phenotypic 
uniqueness (and by extension, the heterogeneity in clinical 
behavior) among patients* tumors may be traceable to underly- 
ing variation in DNA copy number. Sixth, this finding supports 
a possible role for widespread DNA copy number alteration in 
tumorigenesis (17, 18), beyond the amplification of specific 
oncogenes and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread imbalance in gene expression,' might' disrupt critical 
stochiometric relationships in cell metabolism and physiology 
(e.g., proteosome, mitotic spindle), possibly promoting further 
chromosomal instability and directly contributing to tumor 
development or progression. Finally, our findings suggest the 
possibility of cancer therapies that exploit specific or global 
imbalances in gene expression in cancer. 
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Each year, over 182,000 women in the United States are 
diagnosed with breast cancer, and approximately 45,000 die 
of the disease. 1 Incidence appears to be increasing in the 
United States at a rate of roughly 2% per yean The reasons 
for the increase are unclear, but non-genetic risk factors appear 
to play a large role. 2 

Five-year survival rates range from approximately 65%- 
85%, depending on demographic group, with a significant 
percentage of women experiencing recurrence of their cancer 
within 10 years of diagnosis. One of the factors most predic- 
tive for recurrence once a diagnosis of breast cancer has been 
made is the number of axillary lymph nodes to which rumor 
has metastasized. Most node-positive women are given adju- 
vant therapy, which increases their survival. However, 20%- 
30% of patients without axillary node involvement also 
develop recurrent disease,.and the difficulty lies in how to iden- 
tify this high-risk subset of patients. These patients could 
benefit from increased surveillance, early intervention, and 
treatment. 

Prognostic markers currently used in breast cancer recur- 
rence prediction include rumor size, histological grade, steroid 
hormone receptor status, DNA ploidy, proliferative index, and 
cathepsin D status. Expression of growth factor receptors and 
over-expression of the HER-2/neu oncogene have also been 
. identified as having value regarding treatment regimen and 
prognosis. 

HER-2/neu (also known as c-erbB2) is an oncogene that 
encodes a transmembrane glycoprotein that is homologous 
to, but distinct from, the epidermal growth factor receptor. 
Numerous studies have indicated that high levels of expres- 
sion of this protein are associated with rapid tumor growth, 
certain forms of therapy resistance, and shorter disease-free 
survival. The gene has been shown to be amplified and/or 
overexpressed in 10%-30% of invasive breast cancers and in 
40%-60% of intraductal breast carcinoma. 3 

There are two distinct FDA-approved methods by which 
HER-2/neu status can be evaluated: immunohistochemistry 
(1HC, HercepTest™) and FISH (fluorescent in situ hybridiza- 
tion, PathVysion™ Kit). Both methods can be performed on 
archived and current specimens. The first method allows visual 
assessment of the amount of HER-2/neu protein present on 
the cell membrane. The latter method allows direct quantifi- 
cation of the level of gene amplification present in the tumor, 
enabling differentiation berween low- versus high-amplifica- 
tidh. At least one study has demonstrated a difference in 



recurrence risk' in women younger than 40 years of age for 
low- versus high-amplified tumors (54.5% compared to 
85.7%); this is compared to a recurrence rate of 16.7% for 
patients with no HER-2/neu gene amplification. 4 HER-2/neu 
status may be particularly important to establish in women with 
small (< 1 cm) rumor size. 

The choice of methodology for determination of HER-2/ 
neu status depends in part on the clinical setting. FDA approval 
for the Vysis FISH test was granted based on clinical trials 
involving 1549 node-positive patients. Patients received one 
of three different treatments consisting of different doses of 
cyclophosphamide, Adriamycin, and 5-fiuorouracil (CAF). 
The study showed that patients with amplified HER-2/neu 
benefited from treatment with higher doses of adriamycin- 
based therapy, while those with normal HER-2/neu levels did 
not. The study therefore identified a sub-set of women, who 
because they did not benefit from more aggressive treatment, 
did not need to be exposed to the associated side effects. In 
addition, other evidence indicates that HER-2/neu amplifica- 
tion in node-negative patients can be used as an independent 
prognostic indicator for early recurrence, recurrent disease at 
any time and disease-related death. 5 Demonstration of HER- 
2/neu gene amplification by FISH has also been shown to be 
of value in predicting response to chemotherapy in stage-2 
breast cancer patients. 

Selection of patients for Herceptin® (Trastuzumab) mono- 
clonal antibody therapy, however, is based upon demonstra- 
tion of HER-2/neu protein overexpression using HercepTest™. 
Studies using Herceptin® in patients with metastatic breast 
cancer show an increase in time to disease progression, 
increased response rate to chemotherapeutic agents and a small 
increase in overall survival rate. The FISH assays have not yet 
been approved for this purpose, and studies looking at response 
to Herceptin© in patients with or without gene amplification 
status deterfnined by FISH are in progress. 

In general, FISH and IHC results correlate well. However, 
subsets of tumors are found which show discordant results; 
i.e., protein overexpression without gene amplification or lack 
of protein overexpression with gene amplification. The clini- 
cal significance of such results is unclear. Based on the above 
considerations, HER-2/neu testing at SHMC/PAML will uti- 
lize immunohistochemistry (HercepTest©) as a screen, fol- 
lowed by FISH in IHC-negative cases. Alternatively, either 
method may be ordered individually depending on the clini- 
cal setting or clinician preference. 



CPT code information 

HER-2/neu via IHC 

88342 (including interpretive report) 

HER-2/neu via FISH . 

88271 *2 Molecular cytogenetics, DNA probe, each 
88274 Molecular cytogenetics, interphase in situ hybrid- 
ization, analyze 25-99 cells 
88291 Cytogenetics and molecular cytogenetics, interpre- 
tation and report 

Procedural Information 

Immunohistochemistry is performed using the FDA-approved 
DAKO antibody kit, Herceptest®. The DAKO kit contains 
reagents required to complete a two-step immunohisto- 
chemical staining procedure for routinely processed, paraffin- 
embedded specimens. Following incubation with the primary 
rabbit antibody to human HER-2/neu protein, the kit employs 
a ready-to-use dextran-based visualization reagent. This re- 
agent consists of both secondary goat anti-rabbit antibody 
molecules with horseradish peroxidase molecules linked to a 
common dextran polymer backbone, thus eliminating the need 
for sequential application of link antibody and peroxidase 
conjugated antibody. Enzymatic conversion ©f the subse- 
quently added chromogen results in formation of visible 
reaction product at the antigen site. The specimen is then coun- 
terstained; a pathologist using light-microscopy interprets 
results. 

FISH analysis at SHMC/PAML is performed using the 
FDA-approved PathVysion™ HER-2/neu DNA probe kit, pro- 
duced by Vysis, Inc. Formalin fixed, paraffin-embedded breast 
tissue is processed using routine histological methods, and then 
slides are treated to allow hybridization of DNA probes to the 
nuclei present in the tissue section. The Pathvysion™ kit con- 
tains two direct-labeled DNA probes, one specific for the 
alphoid repetitive DNA (CEP 1 7, spectrum orange) present at 
the chromosome 17 centromere and the second for the HER- 
2/neu oncogene located at 17q 11. 2- 12 (spectrum green). Enu- 
meration of the probes allows a ratio of the number of copies 
of chromosome 17 to the number of copies of HER-2/neu to 
be obtained; this enables quantification of low versus high 
amplification levels, and allows an estimate of the percentage 
of cells with HER-2/neu'gene amplification. The clinically 
relevant distinction is whether the gene amplification is due 
to increased gene copy number on the two chromosome 17 
homologues normally present or an. increase in the number of 
chromosome 17s in the cells. In the majority of cases, ratio 
equivalents less than 2.0 are indicative of a normal/negative 
result, ratios of 2.1 and over indicate that amplification is 
present and to what degree. Interpretation of this data will be 
performed and reported from the Vysis-certified Cytogenet- 
ics laboratory at SHMC. 



References 

1. Wingo, P. A., Tong, T, Balden, S., ''Cancer Statistics", I995;45:l :8-3 1 . 

2 "Cancer Rates and Risks", A lh cd., National Institutes of Health. National 
Cancer Institute, 1996, p. 120. 

3 Stamon, D.J., Clark, CM., Song, S.G., Levin, W.J., Ullrich, A., McGuire, 
W.L. "Human breast Cancer; Correlation of relapse and survival with 
amplification of the her-2/neu oncogene". Science, 235:177*182, 1987. 

4 Xing, W.R„ Gilchrist, K.W., Harris, CP,, Samson, W.. Meisner, L.F. 
"FISH detection of HER-s/neu oncogene amplification in early onset 
breast cancer". Breast Cancer Res. And Treatment 39(2):203-2 12, 1 996. 

5 Press, M.F. Bernstein, L'., Thomas, P. A., Meisner, L.F., Zhou, J.Y., Ma, 
Y., Hung, G. t Robinson, R.A., Harris, C, El-Naggor, A., Siamon, D.J., 
Phillips, R.N., Ross, J.S., Wolman, S.R., Flom, K.J., "Her-2/neu gene 
amplification characterized by fluorescence in situ hybridization: poor 
prognosis in node-negative breast carcinomas", J. Clinical Oncology 
15(8) 2894-2904, 1997. 



Provided for the clients of 

V A T H O L O O Y ASSOCIATES M P. I) I C A 1- LABORATORIES 

I 5 a c; L a d Network Laboratories 
Tri-Cities Laboratory 
1 ' Treasure Va i. ley Laboratory 

For more information, please contact 
your local representative. 



© 1999 by Pathology Associates Medical Laboratories. 



I 



Proc. Natl. Acad. Scl. USA 
Vol, 83, pp. 4049-4052, June 198fi 
Medical Sciences 



Variable expression of the translocated c-abl oncogene in 
Philadelphia-chromosome-positive B-lymphoid cell lines 
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ABSTRACT The consistent cytogenetic translocation of 
chronic myelogenous leukemia (the Philadelphia chromosome, 
Ph*) has been observed In cetls of multiple hematopoietic 
lineages. Tills translocation creates a chimeric gene composed 
of breakpoint -cluster-region {bar) sequences from chromosome 
22 fused to a portion of the abl oncogene on chromosome 9. The 
resulting gene product (P2I0 c "* bl ) resembles the transforming 
protein of the Abelson murine leukemia virus in its structure 
and tyrosine kinase pctivity. P210 c '* bl Is expressed In Ph 1 '- 
posltive cell lines of myeloid lineage and In clinical specimens 
with, myeloid predominance. We show here that Epstein-Barr 
virus-transformed B-lymphocyte lines that retain Ph 1 can 
express P210 c ** bl , The level of expression in these B-ceH lines Is 
generally lower and more variable than that observed for 
myeloid lines. Protein expression is not related to amplification 
of the abl gene but to variation in the level of bcr-abl mRNA 
produced from a single Ph 1 template, 

Chronic myelogenous leukemia (CML) is a disease of the 
pluripotent stem cell (1). In greater than 95% of patients, the 
leukemic cells contain the cytogenetic marker known as the 
Philadelphia chromosome, or Ph 1 (2), This reciprocal 
translocation event between the long-arms of chromosomes 
9 and 22 has been used as a disease-specific marker for 
diagnosis and evaluation of therapy. Multiple hematopoietic 
lineages, including myeloid and B-lymphoid, contain Ph 1 in 
early or chronic phase, as well as in the more acute accel- 
erated and blast crisis phases of the disease. 

One molecular consequence of Ph* is the translocation of 
the chromosomal arm containing the c-abl gene on chromo- 
some 9 into the middle of the breakpoint-cluster region (bcr) 
gene on chromosome 22 (3-6).' Although the precise 
translocation breakpoints are variable, an RNA-splicing 
mechanism generates a very similar 8-kilobase (kb) mRNA in 
each case (5-9). The hybrid bcr-abl message encodes a 
structurally altered form of the abl oncogene product, called 
P2|QCH>bi (io-13), with an amino-terniinaJ segment derived 
from a portion of the exons of bcr on chromosome 22 and a 
carboxyMerminaJ segment derived from a major portion of 
the exons of the c-abl gene on chromosome 9, The chimeric 
structure of bcr-abl and the resulting P210 c * abl is similar to the 
structure of the Abelson murine leukemia virus gag-dbl 
genome and resulting P160 v " abl transforming gene product. 
Both proteins have very similar tyrosine kinase activities (10, 
11, 14) which can be distinguished by their relative stability 
to denaturing detergents and by their ATP requirements from 
the recently described tyrosine kinase activity of the Q-abl 
gene product (15). 
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In concert with structural modification of the amino- 
terminal portion of the ai?/gene, increased level of expression 
has been implicated in activation of c-abl oncogenic poten- 
tial. Myeloid and erythroid cell lines and clinical samples 
derived from acute-phase CML patients contain about 10- 
fold higher levels of the 8-kb bcr-abl mRNA and P210 c - abl than 
the c-abl mRNA forms (6 and 7 kb) and P145 ctbl gene product 
(5, 8, 9, li). The higher level of expression of the chimeric 
bcr-abl message in acute-phase cells is not likely to be solely 
due to the presence of the bcr promoter sequences at the 5' 
end of the gene, since the normal 4.5-kb and 6,7-kb bcr- 
encoded mRNA species are expressed at an even lower level 
than the normal c-abl messages (5, 6). 

We have analyzed a series. of Epstein-Barr virus-immor- 
talized B-lymphoid cell lines derived from CML patients (16), 
With such in vitro clonal cell lines, we can evaluate whether 
the presence of Ph 1 always results in synthesis of the chimeric 
bcr-abl message and protein, and whether the quantitative 
expression varies for cells of B-lymphoid lineage as com- 
pared to previously examined mveloid celJ. lines. Our results 
show that cell lines that retain Ph 1 do express bcr-abt message 
and protein, but that the level is generally lower and more 
variable than previously seen for myeloid cell lines. The 
demonstration that the Ph 1 chromosomal template can vary 
in its level of expression of P210 c * Bbl suggests that secondary 
mechanisms, beyond the translocation itself, contribute to 
the regulation of the bcr-qbl gene in different cell types or 
subclones that derive from the affected stem cell. 

MATERIALS AND METHODS 

Cells and Cell Labelings. Epstein-Barr virus-transformed 
B-lymphoid cell lines were established from peripheral blood 
samples of chronic- and acute-phase CML patients as report- 
ed (16). The cell lines are designated according to patient 
number, karyotype, and lineage;. For example, SK- 
CML7Bt(9,22)-33 refers to CML patient 7, B-lymphoid cell 
line, 9;22 translocation (Ph 1 ), cell line 33; and SK-CML7BN- 
2 refers to B-ceH line 2 with a normal karyotype derived from 
the same patient. Repeat karyotype analysis was performed 
to verify the retention of Ph 1 just prior to analysis, for abl 
protein and RNA. Cells were maintained in RPMI 1640 
medium with 20% fetal bovine serum. We have not observed 
any consistent pattern of in vitro growth rate that correlates 
to the stage of disease at the time of transformation with 
Epstein-Barr virus. Cells (1.5 x 10 7 ) were washed twice with 
Dulbccco's modified Ehgle's medium lacking phosphate and 

Abbreviations; bcr\ breakpoint-cluster region; CML, chronic 
myelogenous leukemia; kb, kilobasc(s). 

^Present address: Department of Genetics, University of Washing- 
ton, Seattle, WA 98195. 

*To whom correspondence should be addressed. 
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Table 1. Relative levels of bcr-abl expression in Epstein-Barr 
vtrus-immortaliied B-cell lines and mye loid CML lines 



Cell line* 


CML phase i 


Ph 1 * 


P210* 


mRNA* 


SK-CML7BN-2 


BC 








SK-CML8BN-10 


Chronic 








SK-CML8BN-12 


Chronic 








SK-CML16BN-1 


Chronic 








SK-CML35BN-1 


Chronic 








SK-CML7B5-33 


BC 


+ 


+ + + 


+ + + 


SK-CML21BI-1 


Acc 


+ 


+ + + 


+ + + 


SK-CML21BI-6 


Acc 


+ 


+ + + 


+ + + 


SK-CMLSBt-3 


Chronic 


+ 






SK-CML16BM 


Chronic 


+ 


+ 


+ 


SK-CML35Bt-2 


Chronic 


+ 


+ 


+ 


K562 


BC 


+ 


+ + + + + 


+ + + + + 


BV173 


BC 


+ 


+ + + + + 


+ + + + + 


EM2 


BC 


•t 


■+ + + + 4 


+ + + + + 



♦Cell lines derived from CML patients by transformation with 
Epstein-Barr virus as described (16). Names of cell lines indicate 
patient number and Ph l status: SK-CML7BI indicates a cell line 
derived from patient 7 that carries the 9;22 Ph 1 translocation; N 
indicates a norma] karyotype. Myeloid-erythroid cell lines (K562, 
EM2, and BV173) are described in previous publications (9, 11, 22, 
33). 

^Status of patient at the time cell line was derived. BC, blast ensis; 
Acc, accelerated phase. 

^Presence <+) or absence (-') of Ph 1 as demonstrated by karyotypic 
or Southern blot analysis. 

f 1*210"*" detected as described in legend to Fig. 1. B-ccll tines 
derived from blast-crisis and accelerated-phase patients had levels 
of P210 3- to 5-fold higher (+ + +) than levels of P145. Chronic* 
phase-derived cell lines had P210 levels lower than orjust equivalent 
(+) to the level of P145. Myeloid and erythroid lines had levels of 
P210 5- to 10-fold higher than P145 (+ + + + +). 

^Eight'kilobase bcr-abl mRNA detected as described in legend to 
Fig. 2. Symbols: :t, borderline detectable; + + + 4 + , level of 8-kb 
mRNA 5- to 10-fold higher than that of the 6- and 7-kb c-abf mRNA 
species; + 4 + , level of 8-kb mRNA 3* to 5-fold higher than that of 
the 6- and 7-kb species; + , a level approximately equivalent to that 
of the 6- and 7-kb messages. 

data not shown). There was no difference in the copy number 
of fl6/-related sequences as judged by Southern blot analysis 
(Fig. 4).* Only the K562 cell line control showed an amplifi- 
cation of abl sequences, as previously reported (22 , 23). 
These combined data suggest that differential bcr-ubl mRNA 
expression from a single gene template is responsible for the 
variable levels of P210 <> ** bl detected. This could be mediated 



— P210 



— P145 



Fig. 2. Analysis of steady-state abl protein levels by unmuno- 
blotting. Cell extracts prepared from 2 x 10 7 cells of lines SK- 
CML7BN-2 (A,-), SK-CML7BI-33 (A,+), SK-CML8BN-10 (/?,-), 
and SK-CML8Bt-3 + ) were, concentrated by immunoprecip* 
itation with anti-pEX-2 plus anti-pEX-5. Samples were then clectro- 
phorescd in a NaDodS(V8% pol vac ryi amide gel and transferred to 
nitrocellulose, using protease-facilitated transfer CI 8). abl proteins 
were detected using a mixture of two monoclonal antibodies directed 
against the pEX-2 and pEX-5 afc/-pro(ein fragments produced in 
bacteria (19) as a probe and a pcroxidase-conjugated goal anti-mouse 
second-stage antibody (Bio-Rad) for development. 





Fig. 3. Comparison of abl RNA levels in Ph l -positive and 
-negative B-cell lines. The levels of the normal 6- and 7-kb z-abl 
RNAs and the 8-kb bcr-abl RNA were analyzed by blot hybridization 
using a v~abl probe. RNA was extracted from Ph^negative lines 
SK-CML7BN-2 (A,~) and SK-CMLHBN-1 (£,-). from Ph'-pos- 
itive lines SK-CMUBt-33 (A,+) and SK-CML16BI-3 (J3, + ), and 
from line K562 (C, + ) by the NaDodS0 4 /urea/phenol method (20). 
Polyadenylylated RNA was purified by oligo(dT) affinity chroma- 
tography, and 15 pg of each sample was electrophoresed in a 1% 
agarose/forrnaldehyde gel and then transferred to nitrocellulose. The 
blotted RNAs were hybridized with a nick^translated \-abl fragment 
probe (23) and then auloradiographcd for 4 days. 



by factors influencing the transcription rate of the bcr-abl 
gene or the stability of the mRNA. 

DISCUSSION 

Several lines of evidence suggest that formation of Ph 1 is not 
the primary event that affects the stem cell in CML. Patients 
have been identified that present with the clinical picture of 
CML but only later develop Ph 1 (1). This observation, 
coupled with studies of G6PD (glucose-6-phosphate dehy- 
drogenase)-heterozygous females with CML that demon- 
strate stem-cell clonality by isozyme analysis among cell 

A 12 3 4 5 6 7 8 9 10 11 

\ Wk — 240 




Fig. 4. Southern blot analysis of abl sequences in Ph l -positive 
and -negative B*cell lines. High molecular weight DNA (15 /ig) was 
digested with restriction endonuclease BamHI, separated in a 0.8% 
agarose gel, and then transferred to nitrocellulose. The blotted DNA 
fragments were hybridized with a nick-translated, 2.4-kb B$i 11 v-ofc/ 
fragment (1.5 x'lO 1 cpm/jug; ref. 21) and exposed for 4 days. (A) 
Auto radiogram of afcf-specific fragments in cell tines HL-60 (lane 1), 
EM 2 (lane 2), K562 (lane 3), SK-CML7BI-33 Oane 4), SK-CML8Bt-3 
(lane 5), SK-CML16BM (lane 6), SK-CML21Bt-6 Oane 7), SK- 
CMU5Bt-2 (lane 8), SK-CML7BN-2 (lane 9), SK-CML8BN-2 (lane 
10), and SK-CML35BN-1 (lane 11). (B) Ethidium bromide staining of 
agarose gel prior to transfer to nitrocellulose, showing the level of 
variation in amount of DNA loaded per lane. _ 
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populations that lack the Ph 1 marker, supports a secondary 
or complementary role for Ph 1 in the progression of the 
disease (24, 25). This chromosome marker is found in 
chronic, accelerated, and blast -crisis phases of the disease. It 
is likely' that Ph 1 confers some growth advantage, since cells 
with the marker chromosome eventually predominate the 
marrow and peripheral blood even in chronic phase. During 
the phase of blast crisis, many patients develop additional 
chromosome abnormalities, including duplication of Ph 1 , a 
variety of trisomies, and complex translocations (26). This 
is suggestive evidence for Ph 1 being a necessary but not 
sufficient genetic change for the full evolution of the 
disease. 

The realization that one molecular result of Ph 1 is the 
generation of a chimeric bcr-abl protein with functional 
characteristics and structure analogous to the gag-abl trans- 
forming protein of the Abelson murine leukemia virus 
strengthens the argument for an important role of Ph 1 in the 
pathogenesis of CML. Although the Abelson virus is gener- 
ally considered a rapidly transforming retrovirus, its effects 
can range from overcoming growth factor requirements, to 
cellular lethality, to induction of highly oncogenic tumors in 
a number of hematopoietic cell lineages (27, 28). Even in the 
transformation of murine cell targets, there are several lines 
of evidence that suggest that the growth-promoting activity of 
the v-ab/ gene product is complemented by further cellular 
changes in the production of the malignant-cell phenotype 
(29-31). 

The regulation of bcr-abl gene ( expression is. complex 
because the 5' end of the gene is derived from the non-afc/ 
sequences, bcr, normally found on chromosome 22 (6). The 
level of stable message for the normal bcr gene and the 
normal abl gene are both much lower than the level of the 
bcr-abl message and protein from cell lines and clinical 
specimens derived from myeloid blast-crisis patients (5, 6, 
11). Therefore, the high level of bcr-abl expression cannot 
simply be attributed to the regulatory sequences associated 
with bcr. Possibly, creation of the chimeric gene disrupts the 
normal regulatory sequences and results in a higher level of 
expression. Variation in bcr-abl expression may result from 
secondary changes in the structure of the chimeric gene or 
function of fro/u-acting factors that occur during evolution of 
the disease. Our analysis of P210 c -* bl and the 8-kb mRNA in 
Epstein-Barr virus-transformed Prepositive B-cell lines 
demonstrates that stable message and protein levels from the 
bcr-abl gene can vary over a wide range. This variation does 
not result from a change in the number of bcr-abl templates 
secondary to gene amplification but more likely from changes 
in either transcription rate or mRNA stability. We suspect 
this range of bcr-abl expression is not limited to lymphoid 
cells. Analysis of peripheral blood leukocytes derived from 
an unusual CML patient who has been in chronic phase with 
myeloid predominance for 16 years showed a level of 
P210 c * aM one-fifth that of P145 c - abl , as detected by metabolic 
labeling with ["Pjorthophosphate and immunoprecipitation 
(S.C., O.N.W., and P. Greenberg, unpublished observa- 
tions). Lower levels of expression of the chimeric mRNA 
have been demonstrated in clinical samples from chronic- 
phase CML patients compared to acute-phase CML patients 
(9). Others have reported chronic-phase patients with vari- 
able but, in some cases, relatively high levels of the bcr-abl . 
mRNA (32). The sampling variation and the heterogenous 
mixture of cell types in clinical samples complicate such 
analyses, Further work is needed to evaluate whether there 
is a defined change in P230 c ' ttbl expression during the pro- 
gression of CML. It is interesting to note that among the 
limited sample of Ph^positive B-celi lines we have examined 
(Table 1), we have seen higher levels of P210 c '* bl in those 
derived from patic v at more advanced stages of the disease. 
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It will be important to search for cell-type-specific mecha- 
nisms that might regulate expression of bcr-abl from Ph 1 . 
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Sir: 

I, Audrey D. Goddard, Ph.D. do hereby declare and say as follows: 

1 . I am a Senior Clinical Scientist at the Experimental Medicine/BioOncology, Medical 
Affairs Department of Genentech, Inc., South San Francisco, California 94080. 

2. Between 1 993 and 200 1 , 1 headed the DNA Sequencing Laboratory at the Molecular 
Biology Department of Genentech, Inc. During this time, my responsibilities included the 
identification and characterization of genes contributing to the oncogenic process, and determination 
of the chromosomal localization of novel genes. 

3. My scientific Curriculum Vitae, including my list of publications, is attached to and 
forms part of this Declaration (Exhibit A). 
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4. I am familiar with a variety of techniques known in the art for detecting and 
quantifying the amplification of oncogenes in cancer, including the quantitative TaqMan PCR (i.e., 
"gene amplification") assay described in the above captioned patent application. 

5. The TaqMan PCR assay is described, for example, in the following scientific 
publications: Higuchi et aL, Biotechnology 10:413-417 (1992) (Exhibit B); Livak et al, PCR 
Methods AppL 4:357-362 (1995) (Exhibit C> and Heid et aL, Genome Res. 6:986-994 (1996) 
(Exhibit D). Briefly, the assay is based on the principle that successful PCR yields a fluorescent 
signal due to Taq DNA polymerase-mediated exonuclease digestion of a fluorescently labeled 
oligonucleotide that is homologous to a sequence between two PCR primers. The extent of 
digestion depends directly on the amount of PCR, and can be quantified accurately by measuring the 
increment in fluorescence that results from decreased energy transfer. This is an extremely sensitive 
technique, which allows detection in the exponential phase of the PCR reaction and, as a result, 
leads to accurate determination of gene copy number. 

6. The quantitative fluorescent TaqMan PCR assay has been extensively and 
successfully used to characterize genes involved in cancer development and progression. 
Amplification of protooncogenes has been studied in a variety of human tumors, and is widely 
considered as having etiological, diagnostic and prognostic significance. This use of the quantitative 
TaqMan PCR assay is exemplified by the following scientific publications: Pennica et aL, Proc. 
Natl Acad. Sci. USA . 95(25): 147 17- 14722 (1998) (Exhibit E); Pitti et aL, Nature 
396(6712);699-703 (1998) (Exhibit F) andBieche etaL. Int. J. Cancer 78:661-666 (1998) (Exhibit 
G), the first two of which I am co-author. In particular, Pennica et aL have used the quantitative 
TaqMan PCR assay to study relative gene amplification of WISP and c-myc in various cell lines, 
colorectal tumors and normal mucosa. Pitti et aL studied the genomic amplification of a decoy 
receptor for Fas ligand in lung and colon cancer, using the quantitative TaqMan PCR assay. Bieche 
et aL used the assay to study gene amplification in breast cancer. 
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7. It is my personal experience that the quantitative TaqMan PCR technique is 
technically sensitive enough to detect at least a 2-fold increase in gene copy number relative to 



useful in that the detected increase in gene copy number in the tumor sample relative to the normal 
sample serves as a basis for using relative gene copy number as quantitated by the TaqMan PCR 
technique as a diagnostic marker for the presence or absence of tumor in a tissue sample of unknown - 
pathology. Accordingly, a gene identified as being amplified at least 2-fold by the quantitative 
TaqMan PCR assay in a tumor sample relative to a normal sample is useful as a marker for the 
diagnosis of cancer, for monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. 

8. I declare further that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true. I declare that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Cade, and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereon. 



control. It is further my considered scientific opinion that an at least 2-fold increase in gene copy 
number in a tumor tissue sample relative to a normal (i.e., non-tumor) sample is significant and 
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Slave enhanced the polymerase chain 
reaction* (FCR) such ttfaat specific DNA 
sequences cam be detected wMtomt open- 
ing tbe reaction- telbe* Tfcis eBlb^ceroeM 
requires fthe addition off eHMdiuffl- bjronaide 
(EtBr) to a PCSL Since ftlhe fluorescence of 
EfcBjr increases" in tfhe presence of qionble* 
stranded (ds) DNA mcrease am ffltaores* 
ce»ce in such a PGR indicates a positive 
" wlbidh casa be esa&ily sns«mi« 



am; 



tored estfemally. In fact, amplification caus 
be continuously -mOnitoired in order to 
follow its pragiress, The ability to smjumJhta^ 
meoaislly -amplify specific BNA sequences 
and detect tfce product of Ulne amplification 
foolb simplifies aznd improves PGR and 
may fsiclEitale its automation em»d itiaore 
widespread use in the cliMc osr in otthear 
s&tuafoons requirijig higSa sample 
put 



A 



lthough the potential bicncfi.Q; of PGR 1 to cliu- 
t^al d&gnostics art: weU kttowi* 2,5 , it is still not 
widely used in this setting, even though b » 
four years ciuco thcrnwFtabl^ DMA pofym^f* 
ase$ 4 TPjidc PCR pxacticid. Some of the reasons for it* slow, 
acceptance are high cost, tack of automation of pre- and 
post- PCR processing steps, and false positive results, from 
orryovcT-contanunation. The first two points arc related 
in that labor is the largest contributor to cost flit the present 
stage of PCR development. Most current assays require 
some form of "downstream" processing once thermocy* 
cling is done in order io determine whether the target 
EN A sequence was present and has amplified. These 
include DNA hybridiwtiorr 5 *, gel ekctrophoresis with or 
without use of restriction digestion*;*, HPLCr, or capillary 
electrophoresis 10 . These methods are labor -intense, bare, 
low throughput, and arc difficult to automate. The third 
point is also ck^cty related to downstream processing. 
The handling of the PCR product in these downstream 
processes increases the chances that amplified DNA* will 
spread through the typing lab, resulting in a risk of 



carryover" false positives in subsequent testing". 

These downstream processing steps would be elimi- 
nated if specific amplification and detection of amplified 
DNA took place simultaneously within an unopentd ire- 
action vessel Assays m which such different processes take 
place without , the need to separate reaction components 
have been termed \Tjomogeneous*\ .No truly homoge- 
neous PCR assay has been demonstrated to date, although 
progress towards this end has been reported. Chehab, et 
a]. 1 , developed a PCS. product detection scheme using 
fluorescent primers that resulted in a fluorescent PCR 
product Allctc-specific primers, each with different fluo- 
rescent tags, were used to indicate the genotype of the 
DNA. However, the unincorporated primers must still be 
removed in a do wnstream process in order to visualize tthe 
result Recently, Holland, et al> 15 , developed an assay in 
which the endogenous 5' exonuclease assay of Taq DNA 
polymerase was exploited to cleave a labeled oligonucleo- 
tide probe. The probe would only deave if PCR aznp&fr- 
cation had produced its complementary sequence. lt» 
order to detect the dcavage products, however, a subse- 
quent process m 'again needed. 

We have developed a truly homogeneous assay for PCR 
and PCR product detection based upon the gready in- 
creased fluorescence that ethidium 'bromide and other 
DNA binding dyes exhibit when they aie bound to ds- 
DNA 14_ie . As outlined in Figure J, a prototype PCR 
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I Principle of simultaneous amplification and- detection of 
PCR product The components of a PCR coot3rinhn« EiBr mat arc 
fluorescent are hswd—ttBr itself, EtBr bound to other ssDNA ot 
daDN A. There is a lar^c fiiwresccncc enhancement when EtBr is 
bound to PNA and binding is gready enhanced when DNA .is 
douhk-scranded. After sufficient <n).. cycles of PCR* the .net 
incrca.se in dsDNA retuks in additional £tBr binding, and * net | 
incrcmie in total Ruorcsccncc: 1 
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product 
dtoier 



KS^aGelelcctroprwmbof 1^ amputation products of the 
humat), nuclear gene, HLA DQfc, made in the presence of 
increasing amounts of EtBr (up to 8 ^g/ml)* The presence of 
EtJJr Jjax no obvious effect on ihc yield or specificity of amplifi- 
cation. 



A, 




B. 




MOSSES $ (A) Fluorescence measurement* from PCRs that contain 
D.5 pgftn! EtBr and that arc specific for Y-chromoiOjroc repeat 
5eqocnces. Five replicate PCRj^erc begun containing each of the 
DNA* specified. At each indicucd cyde, One of the five replicate 
PCRs for each DNA "was removed from therm ocy ding and Hs 
fluorescence measured, Unit* of fluorescence Art arbitrary. {B> 
UV photography of PGR tube* (0.5 ml Eppcndor£*tylc, pc4yprC~ 
pykne mtciv<«r»irifugc tubes) contenting reactions, those start* 
ing from 2 ng male DNA and control reactions without any DNA, 
from (A), 



begins with primers that are single-stranded DNA (ss- 
DNA), dNTPs. and DNA polymerase^ An' amount off 
da DNA containing the target sequence (target DNA) is 
also typically present. This amount can vary, depending 
on the application, from single-cell amounts of DNA IT to 
micrograms per PCK^ 8 , If EtBr is present the reagents 
that will fluoresce, in order of increasing fluorescence, are 
free EtBr itself, and EtBr bound to the single-stranded 
DNA primer* and to the double-stranded target DNA (by 
its intercalation between the stacked bases of the DNA 
doubkr-hefat). After the first denatu ration cyde, target 
DNA will be largely sinric-stranded. After a PGR is 
completed, tihe most significant change is the increase in 
the amount of dsDNA (tbe PCR product itself) of up to 
several tnicrbgrams. Formerly free EtBr is bound to the 
additional dsDNA* resulting in an increase in fluores- 
cence. There i* also some decrease in the amount of 
ssDNA primer, but becauM; the binding of EtBr to s*DN A 
is much less than to dsDNA, the effect of this change on 
the total fluorescence of the sample is small* The fluores- 
cence increase can be measured by directing excitation 
illumination through the walls of the amplification 1 
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before and after, or even continuously during, thertnocv 
ding. r 

RESULTS 

PCR in the presence of EtBr. In order to' assess the 
affect of EtBr in FOR, amplifications of the human HLA 
DQa gene >9 were performed with the dye present at 
concentrations from 0,06 to 8.0 u,g/rnJ (a typical concen- 
tration of EtBr used in staining of nuctek acids foltewing 
get electrophoresis is 0.5 u-g/mf). As shown in Figure 2, gel 
electro^borcsis revealed Jktle or no difference in the yield 
or quality of the amplification product whether EtBr was 
absent or present at any of these concentrations, indicat- 
ing that EtBr does not mhibit PCR, 

Defection of huraan Y-chtozacmsira specific se* 
qneraces. Sequence-specific, fluorescence enhancement of 
EtBr as a result of PCR was demonstrated in a scries of 
amplifications containing 0,5 u-g/ml EtBr and primers 
specific to repeat DNA sequences found on the humaa 
Y<hromosomc*°- These PCRs initially contained cither 
60 ng male. 60 ng female, 2 ng roak human or no DNA. 
Five replkate PCRs were begun for each DNA. After 9, 
1 7, 23 , 24 and 29 cycles of thenuocyding, a PGR for each 
DNA was removed from the theirooeyder, and its fluo- 
rescence measured in a spectrofinoro meter and plotted 
vs. amplification cycle number (Fig. 3A). The shape of this 
curve tcSccts the Fact that by the time an increase in 
fluorescence can be detected, the increase in DNA is 
becoming linear and not exponential with cyde number. 
As shown, the fluorescence increased about three-fold 
over the background fluorescence for the PCRs contain- 
ing human male DNA, but did not sigiiincandy increase 
for negative control PCRs, which contained cither no 
DNA or human female DNA. The more male DNA 
present to begin with — 60 ng versus 2 ng— the fewer 
cycles were needed to give a detectable increase in fluo- 
rescence. Gel e^ecti^phoresis w the products of these 
amplifications showed that DNA fragments of the ex- 
pected size were made in the male DNA containing 
reactions and that Utile DN A synthesis took place in the 
control samples. 

In addition, the increase irj. fluorescence waft visualized 
by simply laying the completed* unopened PCRs on a UV 
transilrurninator and photographing them through a red 
filter. This is shown in figure SB lor the reactions that 
began with 2 ng male DNA and those with no DNA* 

Detection of specific alleles of the human $-globm 
gerae. in order to demonstrate that this approach has 
adequate spedfidty to allow genetic screening* a detection 
of the sicWc-ccll anemia mutation was performed- Figure 
4 shows the fluorescence from completed amplification* 

containing EtBr (O.S ng/ml) a* detected fc*y photography 

of the reaction tubes on a UV a-ansiUummator, These 
reactions were pet formed using- primer* specific for ci- 
ther the wild-tvpe or sickk-cell mutation of the human 
p-globin gene* \ The specifidty for each allele h imparted 
by placing the sickle-mutation site at the terminal 3' 
nuefeocidc of one primer. By using an appropriate primer 
annealing temperature, primer extension— and thus am- 
plication — can take place only if the 3' nucleotide of the 
primer is cornnlcmcntary to the 0-globin allele prt*cnt* ,,S2 . 

Each pair 61 ampliations shown in Figure 4 consists of 
a reaction with either the wiktoypc allele specific (left 
tube) or skkle-allele specific (right tube) primers. Three 
different DN As were typed: DNA from a homozygous, 
wild-type p-globin individual (AA); from a heterozygous 
sickle p^ipbin individual (AS); and from a homozygous 
sickle p-glc*b?n individual (^Jffe^NA (50 ng genomic 
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f reactions each). The DNA .type vas reflected in the 
rtJative fluorescence intensities in each pair of completed 
^plificatioitt. There was a significant increase in fluores* 
Jcnce only where a {^globin allele DNA matched the 
^jper set. When measured - on. a spcctroflnoroinctcr 
Matfi not shown), this fluorescence was about three times 
j^t present in a PCR where both p-globhi alleles were 
jnbitiatchcd to the primer set. Gel electrophoresis (not 
shown) established that this increase in fluorescence was 
due to the synthesis of nearly a microgram of a DNA 
fragment of the expected size for p-globin. There was 
little synthesis of dsONA in reactions in. which the allele* 
jjpecinc primer was mismatched to both alleles* 

Continuous momtowvag of a PCR* Using a fiber optic 
devkerit is possible to direct excitation illumination from 
,i spectrofluorometer to a PGR undergoing thcrmocycling 
and to return its fluorescence to the Rpectrofttiororueter. 
The fluorescence readout of such an arrangement, di- 
rected at an EtBr-containing amplification of Y-chromo- 
some specific sequences from 25 r*g of human male DNA* 
is shown in Figure 5. The readout from a control 1*CR 
v|Ui no target DNA is also shown. Thirty cycles of FCR 
were monitored for each- 

The fluorescence trace ax a function of time dearly 
shows the effect of the thermocyding. Fluorescence inten- 
sity rises- and . foils utYCrsdy with temperature* The fluo- 
rescence intend ty is minimum at tbe denatuxadon tern- 
perature (94°C) and maximum at the anneatog/exteiision 
temperature (5<TC). In the negative-control FCR, these 
fluorescence maxima and minima do not change signifi- 
cantly over the thirty tbcrraocycte, indicating that there is 
tittle dsDNA synthesis without the appropriate target 
DNA, and there is little if any bleaching of EtBr during 
the continuous illumination of the sample. 

Jn the PGR containing male DNA, the fluorescence 
maxima at the annealing/extension temperature begin to 
increase at about 4000 seconds of thcrenocycling, and 
continue to increase with time, indicating that dsDNA is 
being produced at a detectable level. Note that the fluo- 
rescence minima at the denaturatton temperature do not 
significantly increase, presumably because al this temper- 
ature there is no dsDNA for EtBr to bind. Thus the course 
of the amplification is followed by tracking the fluores*. 
cence increase at the annealing temperature. Analysis of 
the products of these two amplifications by gel electro pho- 
nal showed a DNA fragment of the expected size for the 
male DNA containing sample and no detectable DNA 
synthesis for the control sampte. 

DISCUSSION 

Downstream processes such as hybridization to a se- 
qucnce-Apedfic probe can enhance die specificity of DMA 
deceuuvu ty PGR* The climu>atk>n of tiffin processes- 
means that the specificity of this homogeneous assay 
depends solely on ihat of FCR. In the case of fickle-cell 
disease, we have shown that PGR alone has sufficient DNA 
Hcqucnee specificity to permit generic screening- Using 
appropriate amplification conditions, there is little non- 
specific production of dsDNA in the absc&ce of the 
appropriate target allele. 

The spedfidty required to detect pathogens can be 
more or less than that required' to do genetic screening, 
depending on the number of pathogens in the samnle and 
the amount of other DNA that must be taken with the 
sample. A difficult target is HIV, which requires detection 
of a viral genome that can be at die level of a few copies 
per thousands of host cells*. Compared whh genetic 
screening, which is performed on cells containing at least 
one copy of die target sequence, HIV idetection requires 
both more specificity and the inpxrt of more total 
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k UV photography of PCK tubes containine Junjpt/fic^Uons 
i«fing EtBr that art specific to wild-type (A) or liefoe (5) aJldat of 
the human £-globin gene. The left ot each pair of tubes contains 
aBcl«»spccific primer* to the wild-type alleles, the right lube 
pruaefs to the sicWe aHek. The phwograph taken after 30 
cycles of PGR, and the input DNAs and the alleles ihev coataia 
are indicated- Fifty tog of DNA was used to begin PGR. Typing 
was done in triplicate (3 pair* of PCRs) for each input DNA: 
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WGGEaE S Continuous, real-time monitoring of a FCR. A fiber optic 
was twed to cany excitation light to a *CR in progress and also 
emitted light back to a floorometcr (bcc FxpenWicntal PrtWocc]). 
AmplificadoD U-MOg human malcDNA specific primers in a PGR 
starting with 20 ng of human male DNA {tooJ T or in a control 
PCH without DNA (bottom), were, roonnorro". Thmv cydes of 
PGR were followed for each. The temperature cycled between 
94*C (denaturation) and 50*C (anmaliog and extension). Note in 
the male DNA PGR,, the cycle (time) dependent increase in 
fluorescence at the anneaEag/exteDsioB temperature. 
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DMA-— itp to microgram amounts— in order to have suf- 
ficient numbers of target sequences. This large amount of 
starting DNA in an ampttfcadoa signitjounUy increases 
the background fluoracence over which any additional 
fluorescence pioduced by VCTL must be detected. An 
additional complication that occurs wiih target* in tow 
copy-number is the formation of the ^rimer-cfimer" 
artifact. This is the rcswk of the extension of one primer 
using the other primer as a template. Although this occurs 
infrequently, once H occurs the extension product is a 
substrate for PGR amplification, and can compete with 
true PGR targets if those targets are rare. The primer- 
dimer product i$ of course dsDNA and thus is a potential 
source of false signal in this homogeneous assay. 

To increase PGR specificity anc reduce the eHeet of 
primer-dimer anipliheation, we are investigating a num- 
ber of approaches, including the use of nested-primer 
amplification* that take place in a singte tube 3 , and the 
4 liot-start*' ( in which nonspecific amplification is reduced 
by raising the temperature of the reaction before DNA 
synthesis begins 23 . Preliminary resuks usong these ap- 
proaches suggest tbatprjOTcr-dimcT is effectively reduced 
and it is possible to detect the increase in Eifir fluores- 
cence in a PGR instigated by a single HIV genome in a 
background of 10 s celts. With larger numbers of cells, the 
background fluorescence contributed by genomic DNA 
becomes problematic. To reduce this bacScgrouod, it may 
be possible to use sequence -sped fie DNA-binding dyes 
that can be made to preferentially bind PCR product over 
genomk DNA by incorporating the dye-binding DNA 
sequence into the PCR product through a £' "add-on" to 
the oligonucleotide primer*' 1 . 

We nave shown that the detection of fluorescence 
generated by an EtBr-containing PCR is straightforward, 
both once PGR is completed aod cominuousDy during 
(J^ermocyding. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
promising aspect of this assay. The fluorescence analysis 
of completed PCRs is already Dosstblc with existing instru- 
mentation in 9G-weJI format* . In this format, the fluores- 
cence in each PCR can be cjuantitated before, after, and 
even at selected points during therroocyciing by moving 
the rack of PCRs to a ^microwcJI plate fluorescence 
reader* 6 . 

The instrumentation necessary to continuously monitor 
multiple PCRs simultaneously is also simple in principle. 
A direct extension of the apparatus used here i$ to have 
multiple fiberoprics transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target DNA copy number. Figure 3 shows that 
the larger the amount of starting target DNA, the sooner 
during PCR a fluorescence increase is detected. Prelimi- 
nary experiments <Higychi and Dollinger, manuscript in 
preparation) with continuous monitoring have shown a 
sensitivity to twofold differences in initial target DNA 
concentration. 

Conversely, if the number of target molecules is 
known — as it can be in genetic screening— rcontinuous 
monitoring may provide a means pf detecting faBsc posi- 
tive and false ncgatTvc results. With a known number of 
target molecules, a true positive would exhibit detectable 
fluorescence by a predictable number of cycles of PCR. 
Increases in fluorescence detected before or after that 
cycle would indicate potential artifacts* False negative 
resuks due to t for example,. inhibition of DNA polymer' 
ase, may be detected by including within each PCR an 
inefficiently amplifying marker. This marker results in a 
fluorescence increase only after a large number of cy- 
cles — many more than are necessary to detect a true 
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positive. If a sample fails to have a fluorescence increase 
alter this many cycles, inhibition may be suspected. Since, 
in this assay, conclusions are drawn based on the presence 
or absence of fluorescence signal alone, such controls may 
be important. In any event, before any test based on this 
principle is ready for the clink, an assessment of its false 
positive/false negative rates will need to be obtained using 
a large number of known samples. 

In summary, the inclusion in PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect spcci&c DNA amplification from outside 
the PCR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PCR 
in applications that demand the high throughput of 
samples^ 

EXmiTMENTAL PROTOCOL 

H'vunam HLA-&Q<* gene dmpKSc&iions containing Et&r. 
PCiRs were set up in 100 j*t volumes confining 10 mM Tris-HCK 
pH 8.3; 50 mM RC1; 4 mM MgO s : S-5 units of too DNA 
polymerase (Perltm*£}frtcr Cctvw, Norwalk, CT); 20 piriole each 
of human HlA-DQa gene specific oligonucleotide primers 
GH26 and CH27 19 aud approximately W copies of DQfr FCK 
product diluted from a previous reaction. Ethfctium bromide 
(Eibr, Sigtw*} was used M the concentrations indicated id Figure 
2- Thcnnocyding proceeded for 20 cytlei in a model 4*0 
thcrmocydcr (PerkjivElmer Ccw, Norwalk, CT) using a "step- 
cycle" program of 94*C for 1 mm. dcnaturalion and 6(rC for 40 
sec annealing and 72°C for 30 sec. extension* 

Y-chromos>oxDc specific PCSL PCRs (J 00 uj total reaction 
volume) conCfubingO^ ftc/id EtBr were prepared as described 
For HLA-DQo, except with different printers and target DNAs. 
These PCRs contained J 5 pmote each male DNA**pccific primes 
YI. 1 and V 1.2*°, abd cither 60 ng male, €0 eg female, 2 ng male, 
or no human DNA. Thcrmocycling was- 94°C Tor 1 min- and SO^C 
for 1 min using a "step-cycle* program. The number of cycles for 
a sample were as indicated in ffgur* 3. Fluorescence measure^ 
merit is described below. 

AUcJc-ap-ccsncy human fJ-glofrin p« PCR, Amplifications of 
100 fU volume v&ng 0-5 jigAnl of £iBr were prepared a* 
described for HLA^DQoi above except -with different primers and 
target DNAs. These PCRs contained either, primer pair 
Hp HA {wiW-type globin specific primers) or HGP2/HpMS {skk- 
-lc-giobin specific primers) at 10 pmoJe each piimcr per PCR. 
These primers were developed by Wu ct aL 21 . Three dttferent 
tatgei DNA a were used in separate amplifications— 60 ng cacb of 
human DNA that was homozygous for (he sickle trait (S5) t DMA 
that was fatcrozygous for the sickle trak (AS), or DNA that was 
homozygous for the W.t. ^Jobio (AA). Thcrmocycfing wa* for ^0 
cycles at 94T for 1 mm. and for 1 min. ttswe a M stej>-cyckf*' 
program. Att annealing temperature of 55*^ bad heco shown by 
Wo ei al. 2J to provide allcJc-spccifk awplifiea.tion. Completed 
PCR* were photographed through a red filter (WraUen 23 A) 
after placing the reaction lobes atop a model 7141*36 tranwHuffli- 
nator <lf V-products Sarv Gabriel, CA>. 

nvoreseence measnrcme&t. Fluorwce»>ee racayurcraen^ wcr* 
madtf ori PCRs contalrdng EtBr in a Fluorolog»2 flUoromcter 
(SPE5C Edison, NJ). ExeiCrtion was at the 5Q0 nra band with 
al»ur 2 nm bandwidth with a GG 435 nrn cut-offfilutr jMeJles 
Grist, inc.* Irvine. CA> to exclude second-order light. Emitted 
light was detected at 5^0 nm with a batidwidtl) of about 7 i>m. An 
OG 530 pm cut-off fi)tcr was used to remove the excitation light. 

Coratitatoofi fftooineRcence nioaiteing of PCR, Continuous 
monitoring ot a PCR in progress ^as accomplished using the 
Bpcctrofiuoromeier and settings descrrbod Above as well a* a 
fiberoptic accessory <SF£X cat no. 1950) 10 both send excttauon 
light to, and receive emittetd light from, a PCR placed in a well of 
a model 480 *eTmocydcr (PtrJtm-Elnier Celus). The probe end 
of the fiberoptic cable was attached with K 5 mmutc-cpoxy* to the 
open top of a PCR cube (a 0.5 ml polvpropyicnc centrifuge rube 
with its cap removed) erTeowely scaling it. The exposed top <n 
ih$ PCR tube and the end of the fiberoptic caWc were shielded 
from room light and the rooco ljgbts were kept dimmed during 
• each run. The monitored PCR was an amplication of V-cbro- 
mo$6me-sfK:drk repeat scqoences as described above, except 
using. an anneaHng/extension lemperauirc of 50°C. The Tcaccon 
was covered with xnirveral oil (2 drops) to prwrtt evaporation. 
TbcrTOOcycUng and- fluorcsocncc measurement were started si- 
multaneously. A time-base scan, witft a 10 second intcgratiOiV tone 
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uf<5cI arid the emission signal was ratioed to %bc cxctoticro 
wjriciftJ W control for chwftes in Ji*ht-sourcc intensity, Efctawcrc 
^ccicd using the dm3O0Ot, version 15 (SPEX) data system. 

We ttorik Bob Jonoa for help with r,he sptarofluormeiiric 
(jKVwUi^iiWMit* And HcalherbelJ Fong for editing this manuscript. 
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IMMUNO BIOLOGICAL LABORATORIES 



sCD=1141SJSA 
Trauma, Shock and Sepsis 



The CD-14 molecule is' expressed on the surface of 
monocytes and some macrophages. Membrane- 
bound CD-14 is a receptor for lipopolysaccharide 
(LPS) completed to LPS-Binding-Protein (LBP). "me 
concentration of te soluble form is aftered under 
certain pathological conditions. There- is evidence for 
an important rote of sCD-14.with polytrauma, sepsis, 
buming$ and inflammations. 
During septic conditions and acute infections it seems 
to be a prognostic marker and is therefore of value in 
monftoring these patients. 



1BL offers an ELISA for quantitative determination of 

soluble CD-14 in human serum, -plasma, cell-culture 

supernatants and other biological fluids. 

Assay features: 12x8 determinations 
(microliter strips), 
precoated with a specific 
monoclonal antibody, 
2x1 hour incubation, 
standard range: 3-96 ng/mi 
detection limit: 1 ng/ml 
CV: intra- and inrerassay < 8% 



For more information call or fax. 
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We Suave ejaliaaaced the poJyme^^s dhaiffa 
reaction (PGR) mch that specific BNA 
sequaeiftces cans be detected wMbrat ©jpeiiL- 

reqpnres ttlnc adaJitioo off elMditam-terDTOde 
(EtBr) to a PGR* Siace *&<e flwwesccnce' of 
EttBir ajacineases' in tffee presence of doaible* 
stomdedl (ds) DNA ap mcrease am fltaosies^ 
cesjee in mch a PGR abdicates a positive 
aucQpIafiicaflSoitt,' wtoicli casa be <e&feily urtoBS- 
tored! extteoially* Ib fart, amplification cans, 
foe coettiiitnoBsiy moiEsitoretfl -in order to 
follow its pragress. The aMlity to siroulto" 
inieoimsSy ■ amplify specific DNA sequences 
and detect tibe product of ^Jhe amplificatiora 
botlb sampBifies and improves PCR sumd 
may lEsiciMtoUte its a^atomaitiojra suad. mntoire 
widespjnrad wse w 'the cSiiiaic o^r ins otKaesr 
sit^^aos^s re^uiriilig hig^a sample 
put 



A 



lthough tbc ixrtemial Vjcnc6,u of PCR 1 to cliio- 
tcal diagnostics arc wcH kudwij 2,5 , it U still not 
widely used in this setting, even though it is 
-A. J four year* eiuop thcrowi^a'bl^ P^A poty™*** 

5we« 4 TPfldc PCR pracdcal. Some of the reasons for its slow. 
aocepLantc are high Gost, tack of automation of pre- and 
post-PCR processing steps, and false positive results, from 
carryover-contamination. The Sm two points arc related 
in that labor is the largest contributor to cost flit the present 
stage of PCR development. Most current assays requite 
some form of "downstream" processing once tbermocy* 
ding t£ done in order io determine whether the target 
IttJA sequence was present and has amplified. These 
include DNA hybridiwtion s ** ( gel eJectrophoreAis with or 
without use of restriction digestion*; V H PLC*, or capillary 
electrophoresis 10 . These methods are labor-intense, hare 
low throughput, and arc difficult to automate. The third 
point is abo closely related to downstream processing. 
The handling of the FC& product in these downstream 
processes increases the chances that ainplified DNA . wilt 
spread through the typing lab, resulting in a risk of 



"carryover" false positives in subsequent testing 11 . 

These downstream processing steps would be elimi- 
nated H" specific amplification and detection of amplified 
DNA took place simultaneously within an unopentd re- 
action vessel Assays m whkh such different processes take 
place without , the need to separate reaction components 
have been termed ■Ijomogeneous'V No truly homoge- 
neous PCR assay has been demonstrated, to date, although 
progress towards this end has been reported. Chehab, et 
al. 1 *, developed a FC& product detection scheme using 
fluorescent primers that resulted in a fiuorcficem PCR 
product Allcfic-fipedfic primers* each with different fluo- 
rescent tags, were used to indicate the genotype of the 
DNA. However, the unincorporated primers uiusi still be 
removed in a downstream process in order to visualize (the 
result Recently, Holland, et al, u \ developed an assay in 
which the endogenous 5 r exOnuclease assay of Taq DNA 
polymerase was exploited to cleave a labeled oligonucleo- 
tide probe, The probe would only dcave if PCR axnp£6- 
cation had produced its roroplementary sequence, to 
order to detect the dcavage products, however, a subse- 
quent process is again needed. 

We have developed a truly homogeneous assay for PCJR 
and PCR product detection based upon tbc gready in- 
creased fluorescence that ethidtum broinide arid other 
DNA binding dyes exhibit when they are bound to.ds- 
DNA 14 ~ U . As outlined in Figure h a prototype PCR 



I PCR evet? 0 \ 



frrc F.lUr 



/ 



siDNAp 



(op ID JTOOTJOV) 




tap w «eveid fig) 

1 -Principle of simultaneous amplification and- detection of 
PCR product. The components of a PCR contmntnjg EtBr that arc 
fluorescent are listed— EtBr itself, EtBr bound to other ssDN A wr 
dsDN A. There is a lar^e nawrescencc enhancement when EtBr is 
bound to DNA and bmdin> is gi'c&tly enhanced when DNA .is 
doubt-stranded. After sufficient <n)..cydcs of PGR, the .net 
increase in d.spNA rejiuks in additional £tfir binding, and Z net 
increase in total fluorescence: 
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' fixpected 
product 

.•pflmer- 



reSSJt$i 3 Ge! electrophoresis of PCS. araplinatfon prod wets of ihc 
human, mtdcar gene, HLA DQtt, made in the presence of 
increasing amounts of EtBr (up to 8 N-g/tnl). The presence of 
£tf*r no obvious effect on the ykw or spcdfkfty of amplifi- 
cation. 



A. 




****** 




$ (A) ftootcsccnce measure nocnt^, -from PCRs that contain 
0.5 pgftnl EiBr and that are specific for Y-chrotno$o*iK: repeat 
teqoencc*. Five replicate PCRs were begun containing each of the 
DNA* specified. At each indicated cycle, one of the five replicate 
FCfts for each DNA -was removed from therm ocy ding and tts 
fluorescence measured. Units of fluoresce n<# are arbitrary. {&) 
UV photography of PCR luoef (0,5 ml E$>pcndt>r£*tylc, polyprO* 
pylcrve mtcixx^ntrifuffc tubes) con taming reactions, those start, 
ing from 2 ng male DNA and control reactions without any DNA, 
from (A), 7 



begins with primers that are single-stranded DNA (ss- 
DNA)» dNTPs, and DNA polymerase! An amount of 
dsDNA containing the target sequence (target DNA) is 
also typically present. This amount can vary, depending 
on the application, from single-cell amounts of DNA 17 to 
micrograms per PCR^ 8 , If EtBr is present, the reagent* 
that will fluoresce, in order of increasing fluorescence, are 
free EtBr itself, and EtBr bound to the smgle^tranded 
DNA primers and to the doublc^stnmded target DNA (by 
its intercalation between the stacked bases of the DNA 
doubkr-heffo)* After the first denatu ration cycle, target 
DNA will be largely single-stranded. After a PGR is 
completed, the most significant change is the increase in 
the amount of dsDNA (the PGR product ttsefif) of up to 
several micrograms. Formerly free EtBr is bound to the 
additional dsDNA* resulting in an increase in fluores- 
cence. There is also some decrease in the amount of 
ssDNA primer, but because the binding of EtBr to ssDNA 
is much Jess than to dsDNA, the effect of thb change on 
the total fluore*xricc of the sample is small. The fluores- 
cence increase can be measured by directing excitation 
illumination through the walls of the amplification vessel 
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before and after, or even continuously during, thermocv- 
ding. ' 

RESULTS 

PGR in She presence of EtBY. In order to' assess th? 
affect of EtBr in PGR, amplifications Of the human Hl*A 
DQa gene* 9 were performed with the dye present at 
concentrations from 0.06 to 8.0 u^g/ml (a typical concen- 
tration of EtBr used in staining of nucteic acids foltewing 
get electrophoresis is 0*5 u-g/mF), As shown in Figure 2> gel 
electrophoresis revealed little or no difference in the Yield 
or quahty of the amplification product whether EtBr was 
absent or present at any of these concentrations, indicat- 
ing that EtBr does not inhibit PGR, 

Defection of human Y-chromcso>irn<B specific 
qDoroces* Sequence-specific, fluorescence enhancement of 
EtBr as a result of PGR was demons crated in a series of 
amplifications containing 0,5 jJtgfml EtBr and primers 
specific to repeat DNA sequences found on the human 
Y-chromosomc 5 ^- These PCRs initially contained cither 
60 ng male, 60 ng female, 2 ng mak human or no DNA. 
Five replicate PCRs were begun for each DNA, After 0, 
17,21, 24 and 29 cycles of thenuocyding, a FCR for each 
DNA was removed from the thermocycler, and its. fluo- 
rescence measured in a spectrofinorometex and plotted 
vs. amplification cyde number (Fag. 3A). The shape of this 
curve reflects the fact that by the time an increase in 
fluorescence can be detected, the increase in DNA U 
becoming linear and not exponential with cyde number; 
As shown, the fluorescence increased about, three-fold 
over the background fluorescence for the PCRs contain* 
ing human male DNA, but did not significantly increase 
for negative control PCRs, which contained either no 
DNA or human female DNA. The more male DNA 
present to begin with — 60 ng versus 2 ng— the fewer 
cycles were needed to give a, detectable increase in fluo- 
rescence. Gel ^ecuwfioresi* oo the products of these 
amplifications showed that DNA fragments of the ex- 
pected &xzc were made in the male DNA containing 
reactions and that little DNA synthesis took place in the 
control samples. 

In addition, the increase in. fluorescence was visualized 
by simply laying the completed, unopened PCRs on » UV 
transilhiminator and photographing them through a red 
filter. This is shown in figure SB tor the reactions that 
began with 2 ng male DNA and those with no DNA. 

Befcectiorn off specific allele* of tike human $-globin 
gerae. In order to demonstrate that this approach has 
adequate spcdfidty to allow genetic screening* a detection 
of the srcklc-ccil anemia mutation was performed- F$gu*c 
4 shows the fluorescence from completed amplifications 
containing EtBr (0.5 ^g/ml) detected by photography 
of the reaction cubes on a UV transillaminaior. These 
reactions were performed using primers specific for ci- 
ther the wild-type or sickle-cell mutation of the human 
p-globin gene* . The specificity for each allele is imparted 
by placing the sickle-mutation site at the terminal V 
nucleotide of one primer. By using an appropriate primer 
annealing temperature, primer extension — and thus ant- 
pliftcatiork — can take place only if the 3' nucleotide of the 
primer is coniojemcntary to the p-globin allele present*' ^ 

Each pair of amplications shown in Figure 4 consists of 
a reaction with either the wild-type allelic specific (left 
tube) or skklc-allele specific (right tube) primers. Three 
different DNAs were tyjped: DNA from a homozygous, 
wiid-typc p-gfobm individual (A A); from a heterozygous 
sickle p-gljobin individual (AS); and from a homozygous 
sickle p-globb individual (SS). Each DNA (50 ng genomic 
DNA to start e&ch PGR) was analyzed in triplicate (3 pairs 
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f reactions each). The DNA .type vas reflected in the 
rtlative fluorescexice intensities in each pair of completed 
^pltfkatioiw. There was a significant increase in fluorea* 
Lr»or only where a ^globin allele DNA matched the 
primer set. When measured on a spcctrcfluoromctcr 
{data ^ ot shown)* this fluorescence was about three times 
j^t present in a PCR where both p-globm alleles were 
misia atehed to the primer set. Gel clcc^phore*)* (not 
fihown) established that this increase in fluorescence was 
due to the synthesis of nearly a microgram of a DNA 
fragment of the expected size for p^lobin. There waa 
little synOiestt of dsDNA in reactions in. which the allele- 
specific primer was mismatched to both alleles. 

Conrinvom-9 monitoribpg of a PGR* Using a fiber optic 
device? 5i h possible to direct excitation illuaunaiion from 
,t spectrofluorometcr to a PCR undergoing thcrmocycling 
a nd to retirrn its fluorescence to the Kpectrofluioronieter* 
The fluorescence readout of such an arrangement, di- 
rected Rt an EtBr-containing amplification of Y-chroxno- 
M>mc $pcci6c scqveoccs rrom 25 ng of Wman male DNA* 
is shown in Figure 5. The readout from a control $CR 
v iih no target DNA is also shown. Thirty cycles of PCR 
were monitored for each- 

The rruorc5Ccnrc trace as a function of time dearly 
shows the effect of the thermocyding. Fluorescence inten- 
sity rises and. fails inversely with temperature The fluo- 
rescence intensity is minimum at the denaturation tem- 
perature (94°C) and maximum at the annealin g/exten&ion 
temperature (50°C). In the negative-control PCR, these 
fluorescence maxima and minima do not change signifi- 
cantly over the thirty tbcrraocycJes, indicating that there is 
tittle dsDNA synthesis without the appropriate target 
DNA, and there is little if any bleaching of EtBr during 
the continuous illumination of the sample. 

Jn the PCR containing niaie DNA, the fluorescence 
maxima at the annealing/extension temperature begin to 
increase al about 4000 seconds of thcnnotycling, and 
continue to increase with rime, indicating that dsDNA is 
being produced at a detectable level. Note that the fluo- 
rescence minima at the denaturation temperature do not 
significantly increase, presumably because at thh temper- 
ature there is no dsDNA for EtBr to bind. Unw the course 
of the amplification is followed by tracking the fluorcs-. 
ecrjee increase at the annealing temperature. Analysis of 
the products of these two amplications by gel clectropho- 
rews showed a DNA fragment of the cicpcctcd size for the 
male DNA containing sample and no detectable DNA 
synthesis for the control sampte* 

DISCUSSION 

Downstream processes such as hybridization to a se- 
quence-specific probe can enhance die specificity of DNA 
detetAluu b? FCR. The enmioatkm of u\csc proccraca. 
means that' the specificity of this homogeneous assay 
depends solely on ihat of PCR. In the case of sickle-cell 
disease, we have shown that PCR alone has sufficient DNA 
sequence specificity to permit genetic screening. Using 
appropriate amplification conditions, there is little non* 
specific production of dsDNA in the absence of the 
appropriate target allele* 

The specificity required to detect pathogens can be 
more or less than that required to do genetic screening, 
depending on the number of pathogens in the sample and 
the amount of other DNA that must be taken with the 
sample. A difficult target is HIV, which TcquiTcs detection 
of a viraJ genome that can be at the level of a few copies 
per thoxisands of host ceils 6 . Compared with generic 
screening, which is performed on cells containing at least 
one copy of the target sequence, HIV ^detection requires 
both more specrfioty and the inpxit of more toraJ 
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Homozygous 



Heterozygous 

AS 



Homozygous 

ss 



l <3 UV photography of PCR tubes containing amplications 
using EtBr ih?t are specific to wild-type (A) or *iem (5> alleles of 
the fnmuiD £-globin gene. The Mt of each pair of tubes contains 
aHele^pedfic primers to the wild-type afleks. the right lube 
priaww to the sfcWe aBele- The photograph vas taken after SO 
cycles of PCR, and the input DNAs and the alkies thev contain 
are indicated- Fifty tog of DNA was used to beein PCR. Typmg 
was done in triplicate (8 pain of PC&O for each input DNA: 



25°C 



20 ng of mate DNA 



25 C 




no DNA control 
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0 2000 4000 6000 8000 
time (sec) 

WGBJCat S Contittttous, real-time monitoring of a FCR. A fiberoptic 
was wed to carry, excitation light to a PCJR in progress and also 
emKtsd light bad to a fluoromctcr (bcc Experimental Prttocd). 
AmpMficaSon iwieg human malo-DNA spcofic primers in a PCR 
Starting with tO ng of human male DNA <top) r or Jn a control 
PCR wtthout DNA (bottom), were, monitored. Thutv cyde? of 
PCR were followed for each. The tcmpcraxim: cycled between 
94*C (denatuTation) and 50*C (annealing and extenwan). Note m 
the male DNA PCR, . the cycle (time) dependent increase m 
fluorescence at the aoneaHaf/extension temperature. 
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DN A — lip to microgram amounts — in order to have suf- 
ficient munbers of target sequences. This large amount of 
starting DNA hi an amplification signitjc^ntly increases 
the background fluorescence over which any additional 
fluorescence produced by PCR must be detected. An 
additional complication that occurs with targets m low 
copy-number is the formation of the "primer-dimer" 
artifact. This is the result of the extension of one primer 
using the other primer a* a template. Although this occurs 
infrequently, once it occurs the extension product is a 
substrate for PCR amplification, and can compete with 
true JPCR targets if those targets are rare, f ne primer- 
d iiner product is of course dsDNA and thus is a potential 
source of false signal In this homogeneous a&$ay. 

To increase PCR specificity and reduce the effect of 
prirner-dirncr amplification, we are investigating a num- 
ber of apjpmach.es, including the use of nested-primer 
amplification* that take place in a single tube*, and the 
4t hot -start", in which nonspecific amplification is reduced 
by raising the temperature of the reaction before DNA 
synthesis begins**. Preliminary results usKng these ap- 
proaches suggest that T>TW3ncr-dijn:cT is effectively reduced 
and it is possible to detect the increase in EtBr fluores- 
cence in a PCR instigated by a single HIV genome In a 
background of 10 9 Celts. With larger numbers of ccHs, the 
background fluorescence contributed by genomic DNA 
becomes problematic. To reduce this background, it may 
be possible to use sequence -specific DNA-binding dyes 
that can be made to preferentially bind PCR product over 
genomic DNA by incorporating the dye-binding DNA 
sequence into the PCR product through a 5' "add-on" to , 
the oUronvdcotidc primer*' 1 . 

We nave shown that the detection of fluorescence 
generated by an EtBr-containing PCR is straightforward! 
both once PCR is completed and continuously during 
thermocycHng. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
promising aspect of this assay. The Huorescence analysis 
of completed PCRs is already possible with cxbtrqg instru- 
mentation in 96-veII format* *. In tlus format, the fluores- 
cence in each PCR can be <juantitated before, after* and 
even at selected points durmg therraocychng by moving 
die rack of PCRs to a 96-microwcH plate fluorescence 
reader 2 * 

The instrumentation necessary to continuously monitor 
multiple PCRs simultaneously is also simple in .principle. 
A direct extension of the apparatus used here is to have 
multiple Hberopdcs transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target DNA copy number- Figure 5 shows that 
the larger the amount of starting target DNAj the sooner 
during PCR a fluorasrenee incrca.se is detected. PrcJirai- 
nary experiments {Higuchi and DolHnger, tnanuscripS in 
preparation) with continuous monitoring have shown a 
sensitivity to two-fold differences in initial target DNA 
concentration. 

Conversely, if the number of target molecules is 
known — as it can be in genetic screemng— continuous 
monitoring may provide a means pf detecting false posi- 
tive and false negative result*. With a known number of 
target molecules, a true positive would exhibit detectable 
fluorescence by a predictable number of cycles of PCR. 
Increases in fiuorcsccnce detected before or after, that 
cycle would indicate potential artifacts. False negative 
results due to, for example,. inhibition of DNA polymer- 
ase, may be detected by including within each PCR an 
inefficiently amplifying marker. This marker Tcsults in a 
fluorescence increase only after a large number of cy- 
cles— many more than arc necessary to detect a true 
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positive. If a sample fails to have a fluorescence increase 
alter this many cycles, inhibition may be suspected. Since, 
in this assay, conclusions are drawn based on the presence 
or absence of fhmrcscence signal alone, such controls rnay 
be important. In any event before any test based on this 
principle is ready for the clink, an assessment of its false 
positive/false negative rates will need to be obtained using 
a large number of known samples. 

In summary, the inclusion m PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect specific DNA amplification from outside 
the PCR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PCR 
in applications that demand the high throughput of 
samples* 

EXPERIMENTAL PROTOCOL 

Hainan HLA-DQ« geoe 6mpKB«ado»fl ctmtainiag Etifr. 
PCRs were set up in 1 00 volumes confining 10 mM Tris-HCK 
pH 8.3; 50 mM KC1: * roM MgO z : « units of too DNA 
polytaerasc (Pertcm*£}mcr Ccma. Norwalk, CT); 20 piriole each 
of human HlA-DQa gene specific oligonucleotide primers 
GH26 and CH27 19 and approximately 10* copies of DQfr PCK 
product diluted from a previous inaction. Ethidium bromide 
(£*Br; Sigma} was used at the concentrations indicated in Figure 
2. ThcrmocycUng proceeded for 20 cycles tn a model 430 
thcrroocyctcr <Perkm-Elroer Cctuft, Norwalk, CT) using a "stcp- 
cyclc" program of 94*C for 1 min- denature tion and 6<rC for 30 
sec. annealing and 72 9 C for 30 kc, extension. 

Y-chromoswmc specific PCSL PCRs (J00 yl total reaction 
volume) contaiuing l)-5 >i£Ad1 EtBr were prepared as described 
for HLA-DQci, except with dhTcrcnt primers and target DNAs. 
These PCRs contained 1 5 pmote each male DNA**pcci% primes 
YI. 1 and Vl.2*°, and cither 60 ng male, 60 Og female, 2 ng male, 
or no human DNA, Thermocytling wasM 6 C?or 1 min- and oOX: 
for ) min using a "rtcp-cyde w program. The number of cycles for 
a sample were as indicated in Figure 3. Fluorescence measure- 
mem is described below. 

AlleJc-apccificy bamaai fft-glo&in PCR* AropUficarions of 
100 pi vpJume' «rm£ 0-5 of XtBr were prepared a« 

described for HLA-DQoi above except with different primers and 
target DNAs. These PCRs contained either, primer pair HOPS' 
KpHA {wild-type glob*n spedBc primers) or HGP2/HpMS (sick- 
lc-globin spcofic primers) at 10 pmole ei»ch primer per PCR. 
These primers were developed Wu ct aL 21 . Three different 
target TjNAa tvere used in separate a mptificatk)n& — 50 ng cacU of 
human DNA that was homozygous for the sfcWte trait (5S), DMA 
that was heterozygous for the skkle traH (A$>» or DNA that was 
homozygous for the Vt.l- gJobin (AA). ThcrmocycBug was for SO 
cycles at 94X1 for 1 min. and 5VC for 1 min. usb| 3 •'stcp-cYd*'' 
program. An suineaHng tfiropcrature of 55*C bad beer* shown vy 
Wu et al. 2J to provide allcJe-spccinc atppiiftcation. Completed 
PCR$ were photographed through a red fitter <Wrauen 23A) 
after placing the reaction tubes atop a model TM-36 transiHunfti- 
nator (tjV-products Sah Gabriel, CA>. 

Fhior«»<«nee measnrancxit Huor wcei>cc rDcasurcracnis went 
mad* oh PCRs containing Et»r in a Fluorolog*2 0tioromCtCT 
(SFEJC Ediswn, NJ). tbedurtion was at the 500 did band with 
about 2 nm bandwidth with * GO 435 nm cut-off filter jMcltes 
Grist, inc.. Irvine. CA) to exclude secend-order light. Eroiued 
light was detected at 570 nm with a bandwiddi of about 7 nm. An 
OG 530 pm cut-off fitter Was used to remove the excitation hgfct 

ContitHtoufi ftnoraiqeence inoniioring of PCR, Continuous 
monitoring of a PCR in progress was accomplisbed using mX 
Gpectrofluorometcr and setunga descxrbod above as well as a 
fiberoptic accessory (SP£X cat no. 1950) to both send excuauon 
fight to, and receive emitted light from, a PCR placed in a well ra 
a model 480 rhfj*nK>cyc!cr (PcrJcm-Elmer Cetus). The probe end 
of the fiberoptic cable was attached with "5 minutc-cpoxy* to tHt 
open top of a PCR tube (a 0.5 ml potypropy 1 " 111 centrifuge tube 
with its cap removed) effectively ncalrng The exposed top i Ot 
the PCR tuoe and the end of the fiberoptic cable were sliicldcd 
from room light and the room light* were kept dimmed during 
each run- The monitored PGR was an ampliricaiion of V-coto* 
rnosornc-spcdrk repeat sequences as described above, except 
using. an anncating/extenskm wmperauirc of 50°C, The reacuoo 
was covered with irr»ijx^l oil (2 drop©) to prevent cvapdration. 
Tberraocycnng and fluorescence measurement were started si* 
multancously. A time-base scan with a 10 second integrarioiif t»ne 
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usta used and ihc *mbsk>n signal was ratiocd to'tbr excitation 
nigrtftJ to control for changes Jight-iourcc intercity, jtata.wcrc 
fleeted using the drn3030f, version 15 (SPEX) data system. 
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IMMUNO BIOLOGICAL LABOflATQRJES 



Trauma, Shock and Sepsis 




The CD- 14 molecule is' expressed on the surfa.ee of 
monocytes and some macrophages. Membrane- 
bound CD -14 is a receptor for lipopotysaccharide 
(LPS) complexed to LPS-Binding-Protein (LBP). The 
concentration of te soluble form is aitefed uncter 
certain pathological conditions. There- is evidence for 
an important role of $CD-14.with polytrauma, sepsis, 
burnings and inflammations. 
During septic conditions and acute infections it seems 
to be a prognostic marker and is therefore of value in 
monftorlng these patients. 



1BL offers an ELISA for quantitative determination of 
soluble CD-14 in human serum, -plasma, ceil-cutture 
supernatants and other biological fluids. 
Assay features: 12x8 determinations 

(microliter strips), 

precoated with 3 specific 

monoclonal antibody, 

2x1 hour incubation, 

standard range: 3-96 ng/mi 

detection limit: 1 ng/m! 

CV: intra- and interassay < 8% 



For more information call of fax 
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^Oligonucleotides with Fluorescent Dyes at 
§ Opposite Ends Provide a Quenched Probe 
uj System Useful for Detecting PCR Product 
jgj and Nucleic Acid Hybridization 

$ Kenneth J. Livak, Susan J.A. Flood, Jeffrey Marmaro, William Giusti, and Karin Deetz 

<yj Perkin-EImer, Applied Biosystems Division, Foster City, California 94404 
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The 5' nuclease PCR assay detects the 
accumulation of specific PCR product 
by hybridization and cleavage of a 
double-labeled fluorogenlc probe 
during the amplification reaction. 
The probe Is an oligonucleotide with 
both a reporter fluorescent dye and a 
quencher dye attached. An Increase 
In reporter fluorescence Intensity in- 
dicates that the probe has hybridized 
to the target PCR product and has 
been cleaved by the 5' ->3' nucle- 
oiytlc activity of Too DNA polymerase. 
In this study, probes with the 
quencher dye attached to an internal 
nucleotide were compared with 
probes with the quencher dye at- 
tached to the 3 '-end nucleotide. In ail 
cases, the reporter dye was attached 
to the 5' end. All Intact probes 
showed quenching of the reporter 
fluorescence* In general, probes with 
the quencher dye attached to the 3'- 
end nucleotide exhibited a larger sig- 
nal In the 5' nuclease PCR assay than 
the Internally labeled probes. It is 
proposed that the larger signal Is 
caused by Increased likelihood of 
cleavage by Toq DNA polymerase 
when the probe Is hybridized to a 
template strand during PCR. Probes 
with the quencher dye attached to 
the 3 '-end nucleotide also exhibited 
an Increase in reporter fluorescence 
Intensity when hybridized to a com- 
plementary strand. Thus, oligonucle- 
otides with reporter and quencher 
dyes attached at opposite ends can 
be used as homogeneous hybridiza- 
tion probes. 



homogeneous assay for detecting 
the accumulation of specific PCR prod- 
uct that uses a double-labeled fluoro- 
genic pxobe was described by Lee et al. (1) 
The assay exploits the 5'-* 3' nucle- 
olytic activity of Taq DNA poly- 
merase (2 * 3) and is diagramed in Figure 1. 
The fluorogenic probe consists of an oli- 
gonucleotide with a reporter fluorescent 
dye, such as a fluorescein, attached to 
the 5' end; and a quencher dye, such as a 
rhodamine, attached internally. When 
the fluorescein is excited by irradiation, 
its fluorescent emission will be 
quenched if the rhodamine is close 
enough to be excited through the pro- 
cess of fluorescence energy transfer 
(FET). (4 - 5) During PCR, if the probe is hy- 
bridized to a template strand, Taq DNA 
polymerase will cleave the probe be- 
cause of its inherent 5' -* 3' nucleolytlc 
activity. If the cleavage occurs between 
the fluorescein and rhodamine dyes, it 
causes an increase in fluorescein fluores- 
cence intensity because the fluorescein 
is no longer quenched. The increase in 
fluorescein fluorescence intensity indi- 
cates that the probe-specific PCR product 
has been generated. Thus, FET between a 
reporter dye and a quencher dye is criti- 
cal to the performance of the probe in 
the 5' nuclease PCR assay. 

Quenching is completely dependent 
on the physical proximity of the two 
dyes. (6) Because of this, it has been as- 
sumed that the quencher dye must be 
attached near the 5 r end. Surprisingly, 
we have found that attaching a rho- 
damine dye at the 3' end of a probe 
still provides adequate quenching for 
the probe to perform in the 5' nuclease 



PCR assay. Furthermore, cleavage of this 
type of probe is not required to achieve 
some reduction in quenching. Oligonu- 
cleotides with a reporter dye on the 5' 
end and a quencher dye on the 3' end 
exhibit a much higher reporter fluores- 
cence when double-stranded as com- 
pared with single-stranded. This should 
make it possible to use this type of dou- 
ble-labeled probe for homogeneous de- 
tection of nucleic acid hybridization. 



MATERIALS AND METHODS 
Oligonucleotides 

Table 1 shows the nucleotide sequence 
of the oligonucleotides used in this 
study. Linker arm nucleotide (LAN) 
phosphoramidite was obtained from 
Glen Research. The standard DNA phos- 
phoramidites, 6-carboxyfluorescein (6- 
FAM) phosphoramidite, 6-carboxytet- 
ramethyirhodamine succinimidyl ester 
(TAMRA NHS ester), and Phosphalink 
for attaching a 3 '-blocking phosphate, 
were obtained from Perkin-EImer, Ap- 
plied Biosystems Division. Oligonucle- 
otide synthesis was performed using an 
ABI model 394 DNA synthesizer (Applied 
Biosystems). Primer and complement 
oligonucleotides were purified using 
Oligo Purification Cartridges (Applied 
Biosystems). Double-labeled probes were 
synthesized with 6-FAM-labeIed phos- 
phoramidite at the 5' end, LAN replacing 
one of the T's in the sequence, and Phos- 
phalink at the 3' end. Following de- 
piotection and ethanol precipitation, 
TAMRA NHS ester was coupled to the 
LAN-containing oligonucleotide in 250 
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FIGURE 1 Diagram of 5' nuclease assay. Stepwise representation of the 5' - 3' ™<^< *<- 
nlity of T«J DNA polymerase acting on a fluorogenic probe during one extension phase of POL 



mM Na-bicaibonate buffer (pH 9.0) at 
room temperature. Unreacted dye was 
removed by passage over a PD-10 Sepha- 
dex column. Finally, the double-labeled 
probe was purified by preparative high- 
performance liquid chromatography 
(HPLC) using an Aquapore C 8 220x4.6- 
mm column with 7-p.m particle size. The 
column was developed with a 24-min 
linear gradient of 8-20% acetonitrile in 
0.1 m TEAA (triethylamine acetate). 
Probes are named by designating the se- 
quence from Table 1 and the position of 
the IAN-TAMRA moiety. For example, 
probe Al-7 has sequence Al with LAN- 
TAMRA at nucleotide position 7 from the 
5' end. 



PCR Systems 

Ail PCR amplifications were performed 
in the Perkin-Elmer GeneAmp PCR Sys- 
tem 9600 using 50-ul reactions that con- 
tamed 10 mM Tris-HCl (pH 8.3), 50 mM 
KC1, 200 pM dATP, 200 u-M dCTP, 200 \iu 
dGTP, 400 |xM dUTP, 0.5 unit of AmpEr- 
ase uracil N-glycosylase (Perkin-Elmer), 
and 1.25 unit of AmpHTaq DNA poly- 
merase (Perkin-Elmer). A 295-bp seg- 
ment from exon 3 of the human p-actin 



gene (nucleotides 2141-2435 in the se- 
quence of Nakajlma-Ii|ima et al.) (7) was 
amplified using primers AFP and ARP 
(Table 1), which are modified slightly 
from those of du Breuil et al, (8) Actin am- 
plification reactions contained 4 mM 
MgCI 2 , 20 ng of human genomic DNA, 
50 nM Al or A3 probe, and 300 nM each 



TABLE 1 Sequences of Oligonucleotides 



primer. The thermal regimen was SO^C 
(2 min), 95°C (10 min), 40 cycles of 95°C 
(20 sec), 60°C (1 min), and hold at 72°C 
A 515-bp segment was amplified from a 
plasmid that consists of a segment of X 
DNA (nucleotides 32,220-32,747) in- 
serted in the Smal site of vector pUC119. 
These reactions contained 3.5 mM 
MgCl* 1 ng of plasmid DNA, 50 nM P2 or 
P5 probe, 200 nM primer F119, and 200 
nM primer R119. The thermal regimen 
was 50°C (2 min), 95°C (10 min), 25 cy- 
cles of 95°C (20 sec), 57°C (I min), and 
hold at 72°C 



Fluorescence Detection 

For each amplification reaction, a 40-u,l 
aliquot of a sample was transferred to an 
individual well of a white, 96-well micro- 
liter plate (Perkin-Elmer). Fluorescence 
was measured on the Perkin-Elmer Taq- 
Man LS-50B System, which consists of a 
luminescence spectrometer with plate 
reader assembly, a 485-nm excitation fil- 
ter, and a 515-nm emission filter. Excita- 
tion was at 488 nm using a 5-nm silt 
width. Emission was measured at 518 
nm for 6-FAM (the reporter or R value) 
and 582 nm for TAMRA (the quencher or 
Q value) using a 10-nm slit width. To 
determine the increase in reporter emis- 
sion that is caused by cleavage of the 
probe during PCR, three normalizations 
are applied to the raw emission data. 
First, emission intensity of a buffer blank 
is subtracted for each wavelength. Sec- 
ond, emission intensity of the reporter is 



Name 



Type 



Sequence 



F119 
R119 
P2 
P2C 
PS 
P5C 
AFP 
ARP 
Al 
A1C 
A3 
A3C 



primer 
primer 
probe 

complement 
probe 

complement 
pnmex 
primer 
probe 

complement 
probe 

complement 



ACCCACAGGAACTGATCACCACTC 
ATGTCGCGTTCCGGCTGACGTTCTGC 
TCGCATTACTGATCGTIXjCCAACCAGTJ) 
GTACTGGTTGGCAACGATCAGTAATGCGATG 

CGGA'ITTGCTGGTATCTATGACAAGGATp 
TrCATCCTTGTCATAGATACCAGCAAATCCG 

TCACCCACACTGTGCCCATCTACGA 
CAGCGGAACCGCrCATTGCCAATGG 
ATGCCCTCCaXATGCCATCCTGCGTp 
AGACGCAGGATGGCATGGGGGAGGGCATAC 

CGCCC TGGACrrCGAGCAAGAGABp 
CCATCTCTTGCTCGAAGTCCAGGGCGAC ^ 



For each oligonucleotide used in this study, the nucleic acid sequence » " 2 

5' - 3' direction. There are three types of oligonucleotides: PCR primer, ^^^^g 
in the 5' nuclease assay, and complement used to hybridize to the corresponding probe. For we 
£ota?t£ 'underlined base indicates a posluon where UN with TAMRA attached was subsri 
tuted for a T. (p) The presence of a 3' phosphate on each probe. 
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A 1 -2 RaQgccctcccccatgccatcctgcgtp 

A1 -7 ^ATGCCCQC'CCCCArGCCATCCTGCGTp 

A1 -1 4 HatgccctcccccaQgccatcctgcgtp 

A1 -1 © SatgccctcccccatgccaQcctgcgtp 

A1 -22 Ratgc^ctcccccatgccatccQgogtp 

A1 -25 RATGCCCTCCCCCATGCCATCCTGCGQp 





513 nm 


582 nm 


RQ" 


RQ+ 


ARQ 




no temp. 


♦ temp. 


no temp. 


+ temp. 








a — 

)A1-2 


25.5 ±2.1 


32.7 ±1.9 


38.2 + 3.0 


38.2 ±2.0 


0.67 + 0.01 


0.86 ±0.06 


0.19 ±0.06 


A1-7 


53.5 ± 4.3 


.395.1 ±21.4 


108.5 ±6.3 


110.3 ±5.3 


0.49 ± 0.03 


3.58 ±0.17 


3.09 ±0.18 


A1-14 


127.0 ±4.9 


403.5 ±19.1 


109.7 ±5.3 


93.1 ±6.3 


1.16 ±0.02 


4.34 ±0.15 


3.18 ±0.15 


A1-19 


187.5 ±17.9 


422.7 ±7.7 


70.3 ±7.4 


73-0 ±2.8 


2.67 ±0.05 


5.80 ±0.15 


3.13 + 0.16 


A1-22 


224.6 ±9.4 


482.2 ±43.6 


100.0 ±4.0 


96.2 + 9.6 


2.25 ±0.03 


5.02 + 0.11 


2,77 ±0.12 


A1-26 


160.2 ±8.9 


454.1 ±18.4 


93.1 ±5.4 


90.7 ±3.2 


1.72 ±0.02 


5.01 ±0.08 


3.29 ±0.08 



FIGURE 2 Results of 5' nuclease assay comparing 0-actin probes with TAMRA at different nucle- 
otide positions. As described in Materials and Methods, PGR amplifications containing the in- 
dicated probes were performed, and the fluorescence emission was measured at 518 and 582 nm. 
Reported values are the average ±1 s.D. for six reactions run without added template (no temp.) 
and six reactions run with template C+temp.). The RQ ratio was calculated for each individual 
reaction and averaged to give the reported RQ" and RQ + values. 



divided by the emission intensity of the 
quencher to give an RQ ratio for each 
reaction tube. This normalizes for well- 
to-well variations in probe concentra- 
tion and fluorescence measurement. Fi- 
nally, ARQ is calculated by subtracting 
the RQ value of the no-template control 
(RQ") from the RQ value for the com- 
plete reaction including template 
(RQ*). 



A series of probes with increasing dis- 
tances between the fluorescein reporter 
and rhodamine quencher were tested to 
investigate the minimum and maximum 
spacing that would give an acceptable 
performance in the 5' nuclease PCR as- 
say. These probes hybridize to a target 



sequence in the human p-actin gene. 
Figure 2 shows the results of an experi- 
ment in which these probes were in- 
cluded in PCR that amplified a segment 
of the p-actin gene containing the target 
sequence. Performance in the 5' nu- 
clease PCR assay is monitored by the 
magnitude of ARQ, which is a measure 
of the increase in reporter fluorescence 
caused by PCR amplification of the 
probe target. Probe Al-2 has a ARQ value 
that Is close to zero, indicating that the 
probe was not cleaved appreciably dur- 
ing the amplification reaction. This sug- 
gests that with the quencher dye on the 
second nucleotide from the 5' end, there 
is insufficient room for Taq polymerase 
to cleave efficiently between the reporter 
and quencher. The other five probes ex- 
hibited comparable ARQ values that are 



clearly different from zero. Thus, all five 
probes are being cleaved during PCR am- 
plification resulting in a similar increase 
in reporter fluorescence. It should be 
noted that complete digestion of a probe 
produces a much larger increase in re- 
porter fluorescence than that observed 
in Figure 2 (data not shown). Thus, even 
in reactions where amplification occurs, 
the majority of probe molecules remain 
uncleaved. It is mainly for this reason 
that the fluorescence intensity of the 
quencher dye TAMRA changes little with 
amplification of the target. This is what 
allows us to use the 582-nm fluorescence 
reading as a normalization factor. 

The magnitude of RQ" depends 
mainly on the quenching efficiency in- 
herent In the specific structure of the 
probe and the purity of the oligonucle- 
otide. Thus, the larger RQ" values indi- 
cate that probes Al-14, Al-19, Al-22, and 
Al-26 probably have reduced quenching 
as compared with Al-7. Still, the degree 
of quenching is sufficient to detect a 
highly significant increase in reporter 
fluorescence when each of these probes 
is cleaved during PCR. 

To further investigate the ability of 
TAMRA on the 3' end to quench 6-FAM 
on the 5' end, three additional pairs of 
probes were tested in the 5' nuclease 
PCR assay. For each pair, one probe has 
TAMRA attached to an internal nucle- 
otide and the other has TAMRA attached 
to the 3' end nucleotide. The results are 
shown in Table 2. For all three sets, the 
probe with the 3' quencher exhibits a 
ARQ value that is considerably higher 
than for the probe with the internal 
quencher. The RQ" values suggest that 
differences in quenching are not as great 
as those observed with some of the Al 
probes. These results demonstrate that a 
quencher dye on the 3' end of an oligo- 
nucleotide can quench efficiently the 



TABLE 2 Results of 5' Nuclease Assay Comparing Probes with TAMRA Attached to an Internal or 3'-terminal Nucleotide 



518 nm 



582 nm 



Probe 


no temp. 


+ temp. 


no temp. 


+ temp. 


RQ- 


RQ + 


ARQ 


A3-6 


54.6 ± 3.2 


84.8 ± 3.7 


116.2 ±6.4 


115.6 ±2.5 


0.47 ± 0.02 


0.73 ± 0,03 


0.26 ± 0.04 


A3-24 


72.1 ± 2.9 


236.5 ±11.1 


84.2 ± 4.0 


90.2 ± 3.8 


0.86 ± 0.02 


2.62 ± 0.05 


1.76 ± 0.05 


P2-7 


82.8 ± 4.4 


384.0 ±34.1 


105.1 ± 6.4 


120.4 ± 10.2 


0.79 ± 0.02 


3.19 ±0.16 


2.40 ± 0.16 


P2-27 


113.4 ± 6.6 


555.4 ± 14.1 


140.7 ± 8.5 


118.7 ± 4.8 


0.81 ± 0.01 


4.68 ± 0.10 


3.88 ± 0.10 


PS-10 


77.5 ± 6.5 


244.4 ± 15.9 


86.7 s 4.3 


95.8 ±6.7 


0.89 ± 0.05 


2.55 ± 0.06 


1.66 ±0.08 


P5-28 


64.0 ± 5.2 


333.6 ±12.1 


100.6 ± 6.1 


94.7 ± 6.3 


0.63 ± O.02 


3.53 ± 0.12 


2.89 ± 0.13 



fa 



Reactions containing the indicated probes and calculations were performed as described in Material and Methods and in the legend to Fig. 2. 
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fluorescence of a reporter dye on the 5' 
end. The degree of quenching is suffi- 
cient for this type of oligonucleotide to 
be used as a probe in the 5' nuclease PCR 
assay. 

To test the hypothesis that quenching 
by a 3' TAMRA depends on the flexibility 
of the oligonucleotide, fluorescence was 
measured for probes in the single- 
stranded and double-stranded states. Ta- 
ble 3 reports the fluorescence observed 
at 518 and 582 nm. The relative degree 
of quenching is assessed by calculating 
the RQ ratio. For probes with TAMRA 
6-10 nucleotides from the 5' end, there 
is little difference in the RQ values when 
comparing single-stranded with double- 
stranded oligonucleotides. The results 
for probes with TAMRA at the 3' end are 
much different. For these probes, hy- 
bridization to a complementary strand 
causes a dramatic increase in RQ. We 
propose that this loss of quenching is 
caused by the rigid structure of double- 
stranded DNA, which prevents the 5' 
and 3' ends from being in proximity. 

When TAMRA is placed toward the 3' 
end, there is a marked Mg 2 "*" effect on 
quenching. Figure 3 shows a plot of ob- 
served RQ values for the Al series of 
probes as a function of Mg 2 * concentra- 
tion. With TAMRA attached near the 5' 
end (probe Al-2 or Al-7), the RQ value at 
OmMMg 2 * is only slightly higher than 
RQ at 10 mM Mg 2+ . For probes Al-19, 
Al-22, and Al-26, the RQ values at 0 mM 
Mg 2 * are very high r indicating a much 



reduced* quenching efficiency. For each 
of these probes, there is a marked de- 
crease in RQ at 1 mM Mg 2 * followed by 
a gradual decline as the Mg 2+ concen- 
tration Increases to 10 mM. Probe Al-14 
shows an intermediate RQ value at 0 mM 
Mg 2 * with a gradual decline at higher 
Mg 2 * concentrations. In a low-salt en- 
vironment with no Mg 2+ present, a sin- 
gle-stranded oligonucleotide would be 
expected to adopt an extended confor- 
mation because of electrostatic repul- 
sion. The binding of Mg 2+ ions acts to 
shield the negative charge of the phos- 
phate backbone so that the oligonucle- 
otide can adopt conformations where 
the 3' end is close to the 5' end. There- 
fore, the observed Mg 2 * effects support 
the notion that quenching of a 5' re- 
porter dye by TAMRA at or near the 3' 
end depends on the flexibility of the oli- 
gonucleotide. 



The striking finding of this study is that 
it seems the rhodamlne dye TAMRA, 
placed at any position in an oligonucle- 
otide, can quench the fluorescent emis- 
sion of a fluorescein (6-FAM) placed at 
the 5' end. This implies that a single- 
stranded, double-labeled oligonucle- 
otide must be able to adopt conforma- 
tions where the TAMRA is close to the 5' 
end. It should be noted that the decay of 
6-FAM in the excited state requires a cer- 
tain amount of time. Therefore, what 



TABLE 3 Comparison of Fluorescence Emissions of Single-stranded and 
Double-stranded Fluorogenic Probes 



518 nm 



582 nm 



RQ 



Probe 



ss 



ds 



ds 



ds 



Al-7 

Al-26 

A3-6 

A3-24 

P2-7 

P2-27 

P5-10 

P5-28 



27.75 
43.31 
16.75 
30.05 
35.02 
39.89 
27.34 
33.65 



68.53 
509.38 

62.88 
578.64 

70.13 
320.47 
144.85 
462.29 



61.08 
53.50 
39.33 
67.72 
54.63 
65.10 
61.95 
72.39 



138.18 
93.86 
165.57 
140.25 
121.09 
61.13 
165.54 
104.61 



0.45 
0.81 
0.43 
0.45 
0.64 
0.61 
0.44 
0.46 



0.50 
5.43 
0.38 
3.21 
0.58 
5.25 
0.87 
4.43 



(ss) Sinsle-stranded. The fluorescence emissions at 518 or 582 nm foi solutions containing a final 
concentration of 50 nw indicated probe, 10 mM Tris-HCl (pH 8.3), 50 mM KC1, and 10 mM Mgd 2 . 
(ds) Double-stranded. The solutions contained, in addition, 100 nM A1C for probes Al-7 and 
Al-26 100 nM A3C for probes A3-6 and A3-24, 100 nM P2C for probes P2-7 and P2-27, or 100 dm 
P5C for probes P5-10 and P5-28. Before the addition of MgCl* 120 jxl of each sample was heated 
at 95°C for 5 min. Following the addition of 80 |d of 25 mM MgCl 2 , each sample was allowed to 
cool to room temperature and the fluorescence emissions were measured. Reported values are 
the average of three determinations. 



matters for quenching is not the average 
distance between 6-FAM and TAMRA 
but, rather, how close TAMRA can get to 
6-FAM during the lifetime of the 6-FAM 
excited state. As long as the decay time of 
the excited state is relatively long com- 
pared with the molecular motions of the 
oligonucleotide, quenching can occur. 
Thus, we propose that TAMRA at the 3' 
end, or any other position, can quench 
6-FAM at the 5' end because TAMRA is in 
proximity to 6-FAM often enough to be 
able to accept energy transfer from an 
excited 6-FAM. 

Details of the fluorescence measure- 
ments remain puzzling. For example, Ta- 
ble 3 shows that hybridization of probes 
Al-26, A3-24, and P5-28 to their comple- 
mentary strands not only causes a large 
increase in 6-FAM fluorescence at 518 
nm but also causes a modest increase in 
TAMRA fluorescence at 582 nm, If 
TAMRA is being excited by energy trans- 
fer from quenched 6-FAM, then loss of 
quenching attributable to hybridization 
should cause a decrease in the fluores- 
cence emission of TAMRA. The fact that 
the fluorescence emission of TAMRA in- 
creases indicates that the situation is 
more complex. For example, we have an- 
ecdotal evidence that the bases of the 
oligonucleotide, especially G, quench 
the fluorescence of both 6-FAM and 
TAMRA to some degree. When double- 
stranded, base-pairing may reduce the 
ability of the bases to quench. The pri- 
mary factor causing the quenching of 
6-FAM in an intact probe is the TAMRA 
dye. Evidence for the importance of 
TAMRA is that 6-FAM fluorescence 
remains relatively unchanged when 
probes labeled only with 6-FAM are used 
in the 5' nuclease PCR assay (data not 
shown). Secondary effectors of fluores- 
cence, both before and after cleavage of 
the probe, need to be explored further. 

Regardless of the physical mecha- 
nism, the relative independence of posi- 
tion and quenching greatly simplifies 
the design of probes for the 5' nuclease 
PCR assay. There are three main factors 
that determine the performance of a 
double-labeled fluorescent probe in the 
$' nuclease PCR assay. The first factor is 
the degree of quenching observed in the 
intact probe. This is characterized by the 
value of RQ* , which is the ratio of re- 
porter to quencher fluorescent emis- 
sions for a no template control PCR. In- 
fluences on the value of RQ~ include 
the particular reporter and quencher 
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FIGURE 3 Effect of Mg 2+ concentration on RQ ratio for the Al series of probes. The fluorescence 
emission intensity at 518 and 582 nm was measured for solutions containing 50 nM probe, 10 mM 
Trls-HCl <pH 8.3), 50 mM KC1, and varying amounts (0-10 mM) of MgCi* The calculated RQ 
ratios (518 ran intensity divided by 582 nm intensity) are plotted vs. MgCl 2 concentration (mM 
Mg). The key (upper right) shows the probes examined. 



dyes used, spacing between reporter and 
quencher dyes, nucleotide sequence 
context effects, presence of structure or 
other factors that reduce flexibility of 
the oligonucleotide, and purity of the 
probe. The second factor is the efficiency 
of hybridization, which depends on 
probe T ra# presence of secondary struc- 
ture in probe or template, annealing 
temperature, and other reaction condi- 
tions. The third factor is the efficiency at 
which Taq DNA polymerase cleaves the 
bound probe between the reporter and 
quencher dyes. This cleavage is depen- 
dent on sequence complementarity be- 
tween probe and template as shown by 
the observation that mismatches in the 
segment between reporter arid quencher 
dyes drastically reduce the cleavage of 
probe. (l) 

The rise in RQ" values for the Al se- 
ries of probes seems to indicate that the 
degree of quenching is reduced some- 
what as the quencher is placed toward 
the 3' end. The lowest apparent quench- 
ing is observed for probe Al-19 (see Fig. 
3) rather than for the probe where the 
TAMRA is at the 3' end (Al-26). This is 
understandable, as the conformation of 
the 3' end position would be expected to 
be less restricted than the conformation 
of an internal position. In effect, a 
quencher at the 3' end is freer to adopt 
conformations close to the 5' reporter 
dye than is an internally placed 
quencher. For the other three sets of 



probes, the interpretation of RQ" values 
Is less clear-cut. The A3 probes show the 
same trend as Al, with the 3' TAMRA 
probe having a larger RQ" than the in- 
ternal TAMRA probe. For the P2 pair, 
both probes have about the same RQ" 
value. For the P5 probes, the RQ" for the 
3' probe is less than for the internally 
labeled probe. Another factor that may 
explain some of the observed variation is 
that purity affects the RQ' value. Al- 
though all probes are HPLC purified, a 
small amount of contamination with 
unquenched reporter can have a large ef- 
fect on RQ~. 

Although there may be a modest ef- 
fect on degree of quenching, the posi- 
tion of the quencher apparently can 
have a large effect on the efficiency of 
probe cleavage. The most drastic effect is 
observed with probe Al-2, where place- 
ment of the TAMRA on the second nu- 
cleotide reduces the efficiency of cleav- 
age to almost zero. For the A3, P2, and PS 
probes, ARQ is much greater for the 3' 
TAMRA probes as compared with the in- 
ternal TAMRA probes. This Is explained 
most easily by assuming that probes 
with TAMRA at the 3' end are more likely 
to be cleaved between reporter and 
quencher than are probes with TAMRA 
attached internally. For the Al probes, 
the cleavage efficiency of probe Al-7 
must already be quite high, as ARQ does 
not increase when the quencher is 
placed closer to the 3' end. This illus- 



trates the importance of being able to 
use probes with a quencher on the 3' 
end in the 5' nuclease PGR assay. In this 
assay, an increase in the intensity of re- 
porter fluorescence is observed only 
when the probe is cleaved between the 
reporter and quencher dyes. By placing 
the reporter and quencher dyes on the 
opposite ends of an oligonucleotide 
probe, any cleavage that occurs will be 
detected. When the quencher is attached 
to an internal nucleotide, sometimes the 
probe works well (Al-7) and other times 
not so well (A3-6). The relatively poor 
performance of probe A3-6 presumably 
means the probe is being cleaved 3' to 
the quencher rather than between the 
reporter and quencher. Therefore, the 
best chance of having a probe that reli- 
ably detects accumulation of PCR prod- 
uct in the S' nuclease PCR assay is to use 
a probe with the reporter and quencher 
dyes on opposite ends. 

Placing the quencher dye on the 3 f 
end may also provide a slight benefit in 
terms of hybridization efficiency. The 
presence of a quencher attached to an 
internal nucleotide might be expected to 
disrupt base-pairing and reduce the T m 
of a probe. In fact, a 2°G-3°C reduction 
in T m has been observed for two probes 
with internally attached TAMRAs. (9) This 
disruptive effect would be minimized by 
placing the quencher at the 3' end. Thus, 
probes with 3' quenchers might exhibit 
slightly higher hybridization efficiencies 
than probes with internal quenchers. 

The combination of increased cleav- 
age and hybridization efficiencies means 
that probes with 3' quenchers probably 
will be more tolerant of mismatches be- 
tween probe and target as compared 
with internally labeled probes. This, tol- 
erance of mismatches can be advanta- 
geous, as when trying to use a single 
probe to detect PCR-amplified products 
from samples of different species. Also, it 
means that cleavage of probe during PCR 
is less sensitive to alterations in an- 
nealing temperature or other reaction 
conditions. The one application where 
tolerance of mismatches may be a disad- 
vantage is for allelic discrimination. Lee 
et al. 0> demonstrated that allele-specific 
probes were cleaved between reporter 
and quencher only when hybridized to a 
perfectly complementary target. This al- 
lowed them to distinguish the normal 
human cystic fibrosis allele from the 
AF508 mutant. Their probes had TAMRA 
attached to the seventh nucleotide from 
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the 5' end and were designed so that any 
mismatches were between the reporter 
and quencher. Increasing the distance 
between reporter and quencher would 
lessen the disruptive effect of mis- 
matches and allow cleavage of the probe 
on the incorrect target, thus, probes 
with a quencher attached to an internal 
nucleotide may still be useful for alielic 
discrimination. 

In this study loss of quenching upon 
hybridization was used to show that 
quenching by a 3' TAMRA is dependent 
on the flexibility of a single-stranded oli- 
gonucleotide. The increase in reporter 
fluorescence intensity, though, could 
also be used to determine whether hy- 
bridization has occurred or not. Thus, 
oligonucleotides with reporter and 
quencher dyes attached at opposite ends 
should also be useful as hybridization 
probes. The ability to detect hybridiza- 
tion in real time means that these probes 
could be used to measure hybridization 
kinetics. Also, this type of probe could be 
used to develop homogeneous hybrid- 
ization assays for diagnostics or other ap- 
plications. Bagwell et al. <10) describe just 
this type of homogeneous assay where 
hybridization of a probe causes an in- 
crease in fluorescence caused by a loss of 
quenching. However, they utilized a 
complex probe design that requires add- 
ing nucleotides to both ends of the 
probe sequence to form two imperfect 
hairpins. The results presented here 
demonstrate that the simple addition of 
a reporter dye to one end of an oligonu- 
cleotide and a quencher dye to the other 
end generates a fluorogenic probe that 
can detect hybridization or PGR amplifi- 
cation. 
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Quantitative nucleic acid sequence analysis has 
had an important role in many fields of biologi- 
cal research. Measurement of gene expression 
(RNA) has b««n used extensively In monitoring 
biological responses in various stimuli Clan et al* 
1994; Huang ct at. 1995a,b; Prud'hommo et al. 
1995). Quantitative gene analysis (T;NA) has 
Ix-en used to d«i*:rmine the genome quantity of a 
particular gene, as in the case ot the human H1LX2 
gene, which Is amplified in -30% of breast tu- 
mors (Slamon et al. 1987). Gene and genome 
quantitation (UNA and UNA) also have been used 
for analysis of human immunodeficiency virus 
(11JV) burden demonstrating changes in the lev- 
els of virus throughout the different phases of the 
disease (Connor et al. 1993; Plutak ct al. iw:ih; 
Jantado et al. 1995). 

Many methods have been described for tin: 
quantitative analysis ot nucleic acid sequences 
(both for RNA and DNA; Southern 19/6; Sharp et 
al. 1980; Thomas 1980). Recently, PCR has 
proven to be a powerful tool for quantitative 
nucleic acid analysis. PCR and reverse transcrip- 
tase (Kl>PO have permitted the analysis of 
minimal starting quantities of nucleic acid (as 
little as. one cell equivalent). This has made pos- 
sible many experiments that could not hove been 
performed with traditional methods. Although 
PCR has provided a powerful tool, it is imperative 



thaT » be U5«U properly for quantitation (U»uy- 
maekers 1995). Many early reports of quantita- 
tive PCR and R'i -PCR described quantitation of 
the PCR product but did not measure, the initial 
target sequence quantity. It is essentia] to design 
proper controls for the quantitation of the initial 
target sequences (Fcrrc 1992; Clement I ex al. 

ion?.) 

Ke%*f«rchex5 have developed several methods 
of quantitative PCR and RT-PCR, One approach 
measures PCR product quantity in the log phase 
of the reaction before the plateau (Kellogg et al. 
1990; Pang ct a). 1990). This method requires 
that each sample has equal Input amounts of 

. nucleic- add and that each sample under analysis 
amplifies with klcut 1c.a1 efficiency up to the point 
of quantitative analysis. A gene sequence (con- 
tained in all *amplcs ot relatively constant quan- 
uTi**, such as p-aelln) can be us«d for sample* 
amplification efficiency normalization. Using 
conventional methods of PCR detection and 
quantitation (gel electrophoresis or plate capture 
hybridization), it is exirtmcly laborious to assure 
that all samples are analyzed during the log phase 
of the retutlon (foT bolh the target gene and the 
normalization gene). Another method, quaiilita* 
live competitive (QC)-PCR, has been developed 
and is used widely for PCR quantitation. QC-PCR 
relics 07i the inclusion of an internal control 
competitor in each reaction (Becker-Andre 1991; 
Platak cl oh WW*). The efficiency of each re 
action is normalised to lb* internal competitor. 
a wnnwn aim mul of Internal competitor £an be 

anuiM 7ne« no/ wj «c:t>t 7nn7/cn/7T 
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nddod lo oach sample. To obtain relative nuani- 
urUon, the unknown target PGR product is com- 
pared with the known competitor \K"M product. 
Success of a quantitative competitive VClK assay 
re.1i«s on developing an internal control llml am* 
I »1 i Tti-rs with the same efficiency as ilic Uti^el 
ccuie. The design of Tin* compcutoi and the vali- 
dation of amplification efficiencies it-quire a 
dedicated effort, Huwevci; because QtM'CR docs 
not require that FCU pi inlucts be aiudy^td during 
the Joj; phase of Ibe. amplification, it is tin: easier 
of the two methods to use. 

Several detection system* wie used for quan 
illative 1*CK and RT~l*c:W analysis; U) agarose 
gels, (2) .fluoresce*] I label lug ofK!R products and 
delect Ion with U.-urr- induced fluorescence usUi$ 
capillary electrophoresis (haseo ct al. 1995} Wil- 
liams ct al. 1 996) or acrylajutdc and (3) j dale 
capture*, and sandwich probe hybridan i or t (Mul- 
der e.i ah 1994). Although these method* pmvrd 
successful, each method requires posl-PCR ma- 
Imputations that add time to the analysis and 
may lead to hiboMituty i wnlniriinaiion. The 
sample throughput of Ibesf n id hi id* i> limited 
(w(|)i I lie- t-xtcpllon of the plftle capture ap- 
proach)- and, th«n:foTr., these methods ore not 
well suitrd ft" u.tcn demanding high snmple 
Throughput (I.e., screening of large numbers of 

M4>luwlc*.ult^ wi di lttly/.Lil^ iWilUJ/lva fwi diagnus* 

Ucs or clinical trials). 

1 1 crc we report tin: development of a novel 
iittay for quantitative DNA analysis. The assay is 
based on 1he ust: of the .S' nuch'toSe assay first 
described by Holland et al. (1993). The method 
u.sits the 5' nucleate activity of 7W</ polymerase to 
dc.avc a noncxtcndlhlc hybridization probe dur- 
ing thr extension phase of T'CK- Thu approach 
uses dunl-Iabclcd fluoro^enic hybridisation 
probes (Lcc ct id. 1!>S>3; nasslcr ct al. l.ivok 
ct al, 1$9£o,b). One fluorescent dye nerve-s as a 
reporter |FAM (i.e., Gearbox /fluorescein)! and its 
emission spectra is quenched by the second fluo- 
rescent dye, TAMRA (i.e., n-carboxy-tetramethyl- 
rhodaminc). The nuclease degradation of the hy- 
hrldi/utlon probe releases the quenching of Ihe 
I'AM fluorescent emission, resulting in an In- 
crease h» peak fluorescent emission at S3 hi mn. 
The use or a sequence detector (AUJ Prism) allows 
measurement of fluorescent spectra of ail wells 
of the thermal cycler continuously during the 
1*CK anipUflcaTlon. Therefore, tiie reactions ans 
monitored in real time- The. output data is de- 
scribed and quantitative analysis of input target 
I )NA sequences 15 discussed below. 
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RESULTS 



PCR Product Derectlon in Time 

The goal vva« to develop a high-throughput, sen- 
sitive, and neruraic gene quantitation assay for 
use In monitoring llpul mediated therapeutic 
gene delivery. A plasmld encoding human factor 
Vlil gene sequence, pI>'8TM (sec. Methods), was 
used as a model therapeutic ge.ne. The assay usr< 
fluorescent Taqman methodology and an instru- 
ment capable of measuring fluorescence in real 
time (Alii Prism 7700 Sequence Del on or). 'I*he 
Taqnu*" reaction requires a hybridization jrrnhr 
Ial>clcd witli two different fluorescent dyes. One 
dye Is a reporter dy« (I ; AM), the other is quench- 
ing dye (TAMRA). When the proU: \s inlacl, fluo- 
icsccnt energy transfer occurs and the reporter 
dye fluorescent emission ia ubsorbed by the 
quenching dye (TAMRA). During Die extension 
phase of the PCR cycle, the .fluorcseeiii hybrid- 
probe is cleaved by the 5'-.T nucleolytic 
activity of thr. DNA polymerase. On cleavage of 
the probe, the reporter dye emission Is no longer 
transferred efficiently to the quenching dye, re 
sultiitK b> «n increase of the report or tlyti fluores- 
ce." t cniS-uiloii ttp^Ctro. VCIK primers und prubuw 
were de»igm*il ft>i Mil* human factor VI 1J se- 
quence and human act In gene, (a.t dt^eribfcd in 
Methods), Optimization reactions were per- 
formed to choose the appropriate probe and 
magnesium concentrations yielding the hiK lu 'st 
hit tm.ii I y of reporter fluorescent signal without 
sacrificing specificity. The InMrumcsnl u;;es a 
charge-coupled device (i.e., CCD camera) for 
measuring the fluorescent emission spectni from 
. r ,(K) t<> rfSO nm, Uach rc:il tube was monitored 
sinpientially for 25 m.suc with continuous moni- 
toring throughout tin: oni]>lificrttit>xi. Uftch tube 
won rr.-cxandrtcd every see. Computer soft- 
ware, was designed tocxainin* the fluorescent 1)1- 
tensity of both the reporter <iye (]>AM).and 
the quenching dye (TAMIIA). Ti*e Huore.sccrtt 
iriteDsity of the quenching dye, TAMRA, chimgvs 
very I It I If. over the course of the PCR ampllfK 
cation (datn not shown) . The.re.f ore, the Intensity 
of TAMIIA dye emission servwt as an internal 
rtlamJard with which to normulbw; the reporter 
dyi: (l : AM) emission variatJons. Il^e software cal- 
culates a vrthn: termed ^Kn (or AIUJ) using the. 
following equation: ARn - (lin 4 ) where 
Hn 4 • enilsifluii iulejishy of reporter/emission in- 
tensity of quencher al any siven lime In a reac 
rioti tube, and Phi r- emission intensility of re- 
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ponci/emlssiciil hilvmily "I quencher measured 
prior 10 I'CK iimpMicat.oii in that same reaction 
tube. l ; or the purpose of quantitation, the U.si 
three claw points (ARns) collected during the. ex- 
tension step for each I'CK cycle were annJy/ed. 
The mideolytic degradation of the. hyuruliy^tion. 
probe occurs during the extension phase or rut, 
and, tht:rvfore, reporter fluorescent umaMun in- 
creases during this time. 'Jiu: thjcv data point?* 
were averaged for cadi cycle and the uichh 
value fur each was plotted in an "aiuplMlcanon 
plot" shown In J'itfurc 1 A. The Alio mean value is 
pitted on the j>-axJs, and time, represented by 
cycle number, is plotted on t|jv*-axis. During the 
«arly evekw erf the VCA\ amplification, the ARn 



COPY 



value remains at base line When sufficient hy- 
bridist Ion probe has hocn cleaved by the Tm) 
]X)lymcriiste nucle.Afie activity, the intensity of ro. 
porU-r fluorescent emission iiif-'reaee*. Most POU 
umplifk-Mjons reach a plateau phone of reporter 
fluorescent omifision if the rcHulion Is carried not 
to high cycle uunibciN. The amjiKfiraflnn plot 19 
examined vuily in the reaction, at a point IhAl 
icj>ioscnts i\w log phase of pnxUid srnnnula* 
1km. This is done by assigning a« arbiUa/y 
ihrcshoJd thai is bused on the varhibil'tly of the 
bass-line data. Jn Figure 1 A, the Ihrttthold whs sc.! 
ui 10 standard deviations above, the mean of 
haw line enifcuum calculated from cy;;Ic:> 1 lo 1 5. 
Oner the threshold is chosen, the point at whtr.h 
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35 «o 

Cycle 

Ftoure 1 PCR product detection in real time {A) The Model 7700 >o!lware win coroinjci ^""^""a'SaI' 
S^the extension phase fluorescent emission data collected during the r>CR ampllflcauon. The standard de- 
Sonl "Jire,. the data points coHected from ft. base «ne ^f^™^^ ~ 
calculated 
Standard deviation 

DMA wmplcs amplified with fi-actm pnmers. (C) Input L ^ _ v 



jelermined from the data points collected from the base line of the ampin canon pin. <-i ™«= «« 
bv determining the poinl a which the fluorescence exceeds a threshold llm.l (usually 10 t,mes the 
Aviation S the base line). (B) Overlay ot amplification plots of serially (1 :2) diluted human genormc 
\t, a m«lified with fi-actin primers. (O Input DNA concentration of the samples plotted versus C T . All 
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the amplification plot crosses the thrcshold'is-elv 
fined as C,, C, is reported us Iho cycle number at 
this point. As will be demonstrated, thw CI, .value 
Is piediidve of the quantity of input hirge.t. 

Cj Values Provide a Quantitative Measurement* of 
Input Target Sequences 

Figure 1R shows amplication ploU of liwdirYrv- 
ent PGR amplifications overlaid. The amplica- 
tions were performed on a 1:2 serial dilution 
human genomic JWA. 'Jlic amplified target v*iu 
human p oetln. The amplification plot* xhift to 
the right (to higher threshold cycles) ns the. input 
target quantity is reduced. 'J1*>is is expected he*, 
eamu niHCtioriK with fewer starting eopies of the 
largct molecule require greater amplification to 
degrade enough probe to attain the Threshold 
fluorescence. An arbitrary threshold of 10 stan- 
dard deviations above the base line was used to 
determine the O r valuer. Figure 1C represents the 
C r valuesf plotted versus the sample dilution 
value, Each dilution was amplified in triplicate 
Pf:R Amplifications and plotted as mean values 
with error bars representing one standard devia* 
tion. The C-r values decrease linearly with increas- 
ing target quantity, Thus, C r values can be used 
as a quantitative measurement of Hie input target 
number. It should be noted that tin* amplifica- 
tion plot for the 15.6»ng sample shown in Figure 
Ifl does not reflect the same fluorescent rate of 
Increase exhibited by most of the other samples, 
'me 15.6-ng sample also achieves endpoint pla- 
teau at a lower fluorescent value than would he 
expected based on the input I>NA. This pJumimi* 
cnon has been observed occasionally with other 
samples (data not shown) and may be attribut- 
able to late, cycle inhibition; this hypothesis is 
still under investigation. It is important to note 
that the flattened slope and early plateau do not 
impact signlflcjHitJy the calculated C, value us 
demonstrated by the Hi on die lint* shown in 
Figure 1 C. All triplicate amplifications resulted in 
very similar Cr values — the standard deviation 
did not exceed 0.5 for any dilution. This experi- 
ment contain** a > 3 00,000-fold range of Input tar- 
get molecules. Using C v values for quantitation 
permits a much larger assay range than directly 
using total fluorescent emission intensity for 
quantitation. The linear range. oi iluoresccnt in- 
tensity measurement of the AIM I'rhm 7700 &e- 
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miMits over n very htrge T;mj>t» nf rplallvr* starting 
tar&t»t quantities. 

Sample Preparation Validation 

Several parameters influence the efllcU ; nry of 
PCR amplification: magnesium and sail conceu : 
nations, reaction conditions (i.e., time and linn- 
poraluro), PCH target size and composition, 
primer sequences, and sample purity. AiJ of Tlie 
.above (actors are common to a single VCR assay, 
except sample to sample purity, in an effort to 
validate the. method of sample preparation for 
the iacior VJJ1 assay, PCK amplification reprodnc. 
ihility and olflclcncy oi 10 replicate sample 
prof mirations were, examined. After genomic DNA. 
was prepared from the TO rfcplicate samples, the 
DNA wasquantiiaicd by ultraviolet spectroscopy. 
Amplifications were performed analyzing p-nciln 
gene, content in 100 and 25 ng of total genomic 
DNA. Each FCK amplification was performed in 
triplicate. Comparison of C r values for each trip. 
Heate sample show minimal variation based on 
standard deviation and coefl'Ieient of variance 
(Tabic 1). l>)ercforc t each oi the triplicate PCK 
ainj)lifications was highly reproducible, demon- 
strating that re.al time PCK using this Instrumcn- 
tnlion introduces minimal variation Into the 
quantitative. J'CK analysis. Cmimarisoii of the 
mean CV, values of ihe. 10 replicate sample prepa- 
rations also showed minimal variability, indicat- 
ing that each sample preparation yielded similar 
results for p-nclin gene quantity. The highest Cy 
difference between any of rhe samples was 0.85 
and 0.73 for the 100 and 25 ng samples, respec- 
tively. Additiona/ly, the amplification of t:ai:h 
sample, exhibited an equivalent rate of fluore.s> 
cent emission intensity change per amount of 
DNA largct analysed as indicaied by similar 
slopes derived from the sample dilutions (Fig. 2). 
Any sample containing an excess of a i'CU inhibi- 
tor would exhibit a greater measured (3-actJn C r 
value for a given quantity of DNA. In addition, 
the inhibitor would be diluted along with the 
sample in the dilution analysis {Hg. 2), altering 
the expected C r value change, Raeh sample am- 
plification yielded a similar result in the analysis, 
demonstrating that this method of sample prepa- 
ration is highly reproducible, with regard to 
sample purity. 

Quantitative Analvsis of a Plasmid After 

7ncn no/ wj «c:m 7nn7/cn/7T 
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Reproducibility of Sample Preparation Method 



100 ng 25 ng 



standard standard 
m£an deviation CV C T mean deviation CV 



18.24 20.48 

18.23 20.55 

18.33 13.27 0.0$ 0.32 20,5 20.51 0.03 0,17 

18.33 20.61 

18.35 20.59 

18.44 0.0(5 03? 70.41 0.11 0.54 
18,3 20.54 

18.3 20.6 

18.42 18.34 0.07 0.36 20.49 20.54 0.06 0.26 

18.15 20.48 

18.23 20.44 

18.32 18.23 0.08 0.46 20.38 20.43 0.05 0.26 

T8.4 20.68 

18.38 20.87 

18.46 18.42 0.04 0.23 20.63 20.73 0.13 0.61 

18.54 21.09 

18,67 21.04 

19 18.74 0.24 1.26 21.04 21.06 0.03 0.15 

18.2B 20.67 

18.36 20.73 

18.52 18.39 0.12 0.66 20.65 20.66 0.04 0.2 

18.45 20.98 
18.7 20.84 

18.73 18.63 0.16 0.83 20.75 20.86 0,12 0.57 

18.18 20.46 

18.34 20.54 

18.26 18.29 0.1 0.55 20.48 20.51 0.07 0.32 

18.42 20.79 

18.57 20.78 

1 B.66 18.55 0.12 0.65 20.62 20.73 0.1 0.46 

(1 10) 18.42 0.17 0.90 20.66 0.19 0.94 



Table 1 . 



Samplo 
no. 

1 
2 
3 
4 
5 
6 
7 
8 
9 
10 



Mean 



(or containing a partial cDNA for human factor 
vni f pi ; 8TM. A scries of tr.-» infections was sot 
up using a decreasing amouni of the pla$mitT(40, 
4, 0.5, and 0.1 (xg). Twr.my-rour hours past- 
trunafei-tinn, total DNA wn« purified from each 
flask uf erlh. p-Acliu gene \.ju<tuLiLy wn^ chox-ji us 
<i value for norma li/at it *n or j*ei jomir. DNA con- 
ccnrraUoii frumeaeh sample. In this cxpeiimciW, 
(i-actin gene content should' remain constant 
relative to rural genomic UNA. Figured shows the 
result of (he p-actln DNA measurement p00 ng 
total DNA determined by ultraviolet spectros- 
copy) of each sample. Kaeh sample was analyzed 
in triplicate and the mean |3-actin C r values of 
the triplicates were plotted (error bars represent 



iH'tw^uii any iwo sample* moam was 0.<>S C n Ten 
nanograms of total DNA uf each sample were also 
examined fur p-aciln. llic results again showed 
that very similar amounts of genomic 1>NA wore 
present; tin: maximum mean \s actio O, value 
difference wa.s 1.0. A$ I'igure 3 shows, the rate of 
P-acrtln C r change Ixriwccn the 100 and 10-ng 
sajnple;* was simitar (slope vaJucs range* hwtwoirn 
3.56 ana -3.45), Th\s verifies again that the 
method of .sample preparation yields samples of 
identical PCR integrity (j.<u ( no sample contained 
an excessive amount of a PGR inhibitor). TTnw. 
ever, these results indicate thui cacl) sample* con- 
tained slignt diffciei7C.es in the actual amount of 
genomic ON A analysed. Determination of actual 
genomic ONA ^anecnUtttion wo$ accomplished 
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Figure 2 Somple preparation purity. 1 he re plicate 
camples shown in Table 1 wore also amplified In 
tripicate using 25 ng of each DMA sample. The fig* 
utK shows the input DNA concentration (TOO and 
ncj) vs. C, In ih*» 1irjn«\ thp 100 nnd 75 ng 
points for each sample are connected by a line. 



by plotting the mean P-actin C t value obtained 
for uach 100 ll£ saiUplv vmi a fUiciln standard 
curve (shown in J'Sg. 4Q. The actual ficnumlc 
ONA concent rsi*l«"> of each sutnpi*:, «, was ob 
talncd lay extrapolation tt> tin- 

Figure 4 A shows the .measured (l.u., nu^- 
norniall7.«ti) qumititie-s of factor VIJJ plnainid 
ONA (pJtn"M) from each of tlu: four transient cell 
Inunctions. Each reaction contained ]00 ng of 
loml sample DNA {&$ determined by UV spectro* 
copy). VuxcU sample vviis uualyzcd in triplicate 



25- 



21 



20 



V -27,73 J ^IKVrll. 1 

y »w».7r t -s.sax w»i 



pp8TM transtooto d 

• 0 J MO 
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Figure 3 Analysis of tianslectcd cell DNA quantity 
and purity. I lie DNA preparations of tin-* four 293 
cell transections (40, 4, 0.5, and 0,1 jmg of pF8TM) 
were analyzed for the p-actln gent** 100 and 10 ng 
(determined by ultraviolet spectroscopy) of each 
sample were amplified in triplicate. For each 
amount of pF8TM that was transferred, the p-actln 
C T values are plotted versus the total Input DNA 



PClU amplifications. As shown, pI*'8TM purified 
f fUiir Jbc 29H cells decreases (mean C, values in- 
ew.tsi'tj with decreasing amounts of plaamld 
itruiisU'iAcd. The mean C A values obtained fur 
pFbTM in'Hgurc 4A were plotted on u standard 
curve comprised of svilally diluted pKHTM, 
shown .in figure 4R. The quantity oJ ph'XTM, //, 
found in ca*:h of the four transections w:\s de- 
termined by extrapolation to the* jt uxh of t)ie 
standard curve In 1'igurc 4H. Thttsc uncorrected 
values, b, for ij>HTM were iioruiMllvAiri to del er- 
mine the actual amount of pl'8'lM found per 100 
"g of genomic ONA by using the equation:. 

/> X 100 ng actual pl-B'Hvl copies oer 
T 100 ng of genomic DNA 

where a - actual- genomic DNA in u satnpie and 
b i- pl : H*l*M copies from the standard curve. '11 ic 
normoJi^cd ^vantity of p(WM per 100 ng of ge- 
nomic DNA for each of the four 1 ransfectlon.s Is 
shown in higure 4 J J. 'Hicm: rout Is ,%how ill at the 
quantity of factor vill plasuiiO associated wiiii 
tnc cells, 21 lir uftcr iruiusfvciKin. 0i:i.u:.ise.'» 
with OeercaslUK pittsmiiJ uini.«iuiatjon u.scxt in 
the tran^fedion. "Hit: quantity of pJ'tJTM nssoei- 
ateu with 293 cells, after trunsfectlon with 40 ixg 
of pKisjnid, was 35 pgptr 100 ng ^vtioiiiIc DNA. 
This results in -520 plasmid copies per cell. 



DISCUSSION 

Wo have described a new method for qunntitni- 
infc gene copy numbers using rcaMlmc anulysts 
of PCX amplifications. Reai-tlmc i J CK is compat- 
ible with cither of the two PCK (K7-PCR) ap- 
proachei: (1) quantitative con»p cl ^' v ^-' where An 
intcuiid wmpcliior for each target sequence is 
used for nomialiKaUon (data not shown) or (2) 
quantitative comparative 3*CK ush/y ti lujuinili^ci- 
tlon gene contained within the sample (i.e., (3-nc- 
tin) or a "housekeeping" gene, for RT-J'CK. If 
e.quai amounU of nucleic odd arc analyzed for 
each sample and if lllC ampltflcatiun effk ir.ru.y 
before quantitative* analysis i> identical for each 
sample, the iTirernai cuiiluil (nwjm.di^Uion uene 
or competitor) shouid give eqiud »ignals for al) 
samples. 

The real-time PCK method offers several ad- 
vantages over the other two methods currently 
employed (see the Introduction). l : irsl, the real- 
time PCK method is performed in a closed-tube 
system and requires no post -PCK manipulation 

t r» /■» <? r»rti o^o w.T nn *r»T 7nn? /on /7T 
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Figure A Cliiantltaiivo analytic of pFSTM in transfccted cclb. (A) Arnounl of 



plasmid DNiA used for I he trunsfealon plotted against the mean C, vwfue deter- 
mmfld for pffiTM remaining £>| br alter (nantfection. (0,0 Standard curves of 
pFRTM and P-actin, respectively. pfQTM DNA <0) and genomic. DNA (Q were 
diluted *Arlally 1 ;5> before amplification with the oppropn'nte primers. The fi-actin 
standard curve wav used 10 normalise the results of A lo 1 00 ny of genomic DNA. 
(D) The amount of pVSTM present pv.r 100 ng of genomic DNA. 



of simple. Therefore, |h«« potential for fCR con- 
lamination in the Jabi>r;ii<»ry is reduced because 
amplified products can ln» analysed and disponed 
of without opening the r tutu ion tubes. Second, 
(his method suppoiU die umi of a iiurm;i1iy.<ith>n 
gene (Lc., p-octin) for quantitative PGR or house- 
keeping genes for quantitative RT-PCK controls, 
Analysis is performed in real time during the Jog 
phase of product accumulation. Analysis during 
lo* phase permits many different genes (ova a 
wide input target range) to be analy^rd simulta- 
neously, without concern of reaching reaction 
plateau at different cycles, This will make mulli- 
gan e analysis assays much caMei U/ develop, be- 
cause Individual internal i.ui npe.lUoi> will mil be 
needed for each gene under analysis. TJdrd, 
sample throughput will iut-Jtrasc 0 r ur iijs Uctflly 
with the new method because, there is no post- 
PCK pmccnslng time. Additionally, winking 1 ii a 
°6-we.il format h highly compatible with auto- 
million technology, 

The real-time PGR method is highly repro. 
ducible. Replicate amplifications can be analyzed 



for ffach sample minimizing jxMcnUal error. The. 
sysuiin allows* u>r a very large assay dynamic 
runge (approaching 1,000,000 -fold starting Uii- 
get). Uaiiig a .standard curve for the. target oi in- 
terest, relative copy number values can be deter- 
mined for any unknuvvji * ample, hi u orescent 
threshold values, C 1V conrJair. linearly with rela- 
tive DNA copy numbers. Ileal time quantitative 
K'J-rCK methodology (d'lbsoji et al v this l.«uft) 
ha.i a ho been developed. Finally, real time quan- 
titative I'CU methodology can be used tu develop 
high-throughput screening assays for a variety of 
applications [quantitative gene c*pica»iun (KT- 
rCl^), ftcne copy nasaya (Mcr2, IHV^ etc.)* geno- 
typlng (knockout mouse analysis), and Jnimuno- 

rcuj. 

Rr..il-ti)ncs PCtt :r»ay jiImj he j Perform rtl using 
intercalating dyes (Higtichi ct al. such its 

e.iJddium bromide. The fluorogenic probe, 
method offers a major advantage over inter- 
calating dyes— greater specificity (i.e., primer 
dimvrs and rjoTjspcrJHc PCR products are. not de- 
tected). 
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METHODS 

Generation of <t Plasmtd Containing a Purtlol 
cDN A Tor Human Factor YI1I 

Total RNA harvested tUNAwl H- from Tel Test, * ,,c -r 
r*nc»65wood, TX) from evJI> ii-nafecled with a factor VUl 
expression veetor, pc:iS2.tk?.Sl J (Katun ei al. WHO; Gor. 
ninn c! al. 19t>0). A factor VIII partial ellNA wpiwuv WttS 
^..utmlod by in* K:lt'|<iw»oAnip V'J. iTlh ItNA ITJI Kit 

(pan NttOK-ur/y, i'h Appiun) tiiosyucms, Vosivi <'hy, <.;a)J 

using the Hru jjiimurs PHfor mimI 1**Rjx-v (prinii'f srerucurrt 
aire shown below). 'On* ampHcon was reaniplihVd uslnfi 
modified I'Ufo* and i*rcv primers (tipix'iuled with kuwin 
and HwdlJJ restriction site sequences »i the h' end; and 
Clonal into jXiKM- 3Z (]Vomc$»u Corp., MuiIlmoi), WI). The 
resulting clone, pPSTM, was uwil lor transient transection 
oJ 293 cdlA. 



Amplification of Target DNA anil Detection of 
Amplicon Factor VIII Ptasmid DNA 

(pHJTM) was *nnpltfl«il with tliv ptimvi* I**8for S'-OCCI- 

<rr(K;(^\AUAu:jtjAtxiicnv.-3' and wrev .v-AAAc:crr- 

IJAOGCrrGGATCUjTAOCi-.V.Tlie rvticlUm piudiivrd ti 49.% 
np rt'zK product. The forwiird primer ww denned to icl* 
oynl/.e a unique m'i picnic- fwurid In the .V untranslated 
region of the. paiciil uC152.tk25l> plewiml ntitl thnicforc 
dues not iVL-wj^/jUu iiitd amplify ihc human factor V1 11 
gem*, l'riinnrft woro chosen with tho avsivtaitcr* of I he com- 
puter program Oligo l«0 (Nulimiid lluweicnees, Inc.* Ply. 
mouth, MN). The human p-acttn gpnc was amplified w'Uh 
the primers (J-iK-iin forward primer S ' TCAOCCACA< 7VtVV 
GCCCATO'AOC'JA-.V and fi-uclijj icverse piiwcr .S'.CAC. 
C0GAACCX:f:i*<:ATr(U:c^A , j'Cc1-3'. The reaction pro- 
ciueeo a 2V5-np i«c:u product. 

Amplification reactions (SO fJ) contained a DNA 
sample, Klx I'CR liuffur II (S 200 pkt dATP, dCTT, 
dGTP, and 400 p.M rillTP, A mxi Mg<:i ?( l.^S Unlis Ampll 
7«</ r;NA pojymciasc, 0.5 unit AmpKrasc uracil rV-fiiy- 
ciMyluM' <UH0), fit) pmolvof each fftctoi VUl j^liix-i, wnd 15 
(.xiiol^oC ituoh |< Actln |)il»nt»r. 'Hui leai'tlum nl«j t:t>nlatncd 
OIK Of the following (lt*HTtlon prnlM*K (MM J iim rnrli)i 

i'Bpn.»ii<- A'(i'AW)Ac:crJvri , c:ru(:c:T<;frn , (rrrr(:rtn'- 

GCCTT(TAWRA)p 3' <iud p-nttin probe 5 f (TAM)ATCC<:c;- 
X(TAMKA)CCCCr:ATCt"OATtV-.T where p indicates 
phosphoryi/Hirtii /ind X hidlrotc* a linker arm nucleotide. 
Rene Lion luU^ w«-tv Mit:ftiAit\p Optical 'lulx-s (part num- 
ber NK01 Pcricin Ulmux) tJiat wore frusUKl tut JVrfctn 
Hlmcr) to prevriil lijjhl iVom reflecting. Tube cop* were 
si mi 1m* i" Micro A nip c;np» but specially designed to pre- 
vent light sc\jIUt'"J(. All <H IH;R Cami/«wii)u1>U>a wcro sii>w 
l^l;«d t>y PK Applied lljocypitfno ClUy, CX) except 
ihr faclt>r VIU primers, which wen- sy mhc^l/eil at Cencn 
lech, Inc. (Stuit)» f»i*n F«inc!soo ( CA). Prohev wm* dc-sJ^nvcl 
using the OI);;r.> 4.0 50 f I wore, folk'wing giililfllitos ku^- 
jjCMwi in tiic Modd 7700 .Sequence Puuvtiw hmuuiicfil 
manual. Urlcfly, prubc 'J' OT he al Jeaat 5 U (J higher 
mail OH' ^tuu'ulliiX teuipcj/iiiirc u.vrd durlnj; ihrrmul cy- 
rhiigj primers Shc5\dil not fviin sUUW duplexed with the 
prnbe. 

The iluTiuol cycling condition* In eluded 2 juId at 
5trC and 10 n»in at 95"C 'Utejinal cycling prorrrded with 



nactioiiii wore perfonuod in tho Modrl 77(H) Sequence W- 
te«K>r (rt Applted Ulosyylvuiv), wlilrh contains » Ccn» - 
Anip l»< :K Sybtvni P«X>. kcactlon «xuidition« w<-rr- pro. 
fcrumiinxl un .1 I'wwvr Mncintt^h V100 (Apple f.Xunpntor, 
Santa Cloru, tiA) linked dirvcily to the Model WOO .V- 
tjucnev l>itl*clor. Ana'yvU *>f data w»« alw.i \u*r<^tmtH\ on 
iliv M»»i lntfj«h compviter. CVtl Inn Ion and wn^lyKls toft ware 
wu% devclo|x»d ?*t in-: Applied nicxy*imHs. 



Transection of Cells with Factor VIII Construct 

J.Viur T17.S Oasks of 293 cells (ATCX: CM. J57H), * human 
feiol kidney stHipention cell line, wvrc b^owh ft0% con- 
lliicm-y and transected pIVIM Cells were grown in the 
following medim St)% HAM'S ¥)2 without GHT, lowi 
glucose JIuJIkuhni's modified Kaxloinvdiiiiu (])MKM) wltJi- 
cm glycnni: witli sodium bicarbonate, 10% leial Ikivuic 
scrum, 2 him L-j;luidinii)c, And 1% penitillin-slrcptomy-' 
win. The medio waa diart^cd 30 mln l>eA»«* the transfce 
lion. pl : UTM DNA amounts of 40, 4, OA, .ind 0.1 p,»; were 
added it) ml of fl sohitJon containing O.IZS m Ci»0J ? ; 
and 1 x IH'J'US. The four mixtures wore left at room tein- 

[.K-.n»t»jn- (<it TO mill and then iidded Hnipwlsc U> the cells. 
Tliv n«*k> wvit* Wii-oUiled al 37°C and 5»% ClO. for 24 hr, 
washed wiUj l'ltS, i*»d r^Mspe.ndcd In I'liS. The 
jft-iuK^I celb were divided into ediquois und wad CV- 
tr»cted luinicdlutely usiiij; Ihv QIAunii* RUwtd Kit (Qiapen. 
Ch«t,vrvortli, <.^), DNA wos cluted Into 200 uf 30 i«m 
Trls-IICl at pll tt.0, 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-1 and WISP-2, that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISP-3, these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (i) C57MG ceils infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracylihe repressible promoter, and («) Wnt-1 transgenic 
mice. The WISP-1 gene was localized to human chromosome 
8q24.1-8q24 j. WISP-1 genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to > 30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISPS 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to > 40-fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to > 30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 

Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsh) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-3/3 (GSK-3/3) resulting in an increase in 
0-catenin levels. Stabilized /3-catenin interacts with the tran- 
scription factor TCF/Lefl, forming a complex that appears in 
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the nucleus and- binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
/3-catenin levels (9). APC is phosphorylated by GSK-30, binds 
to 0-catenin, and facilitates its degradation. Mutations in 
either APC or j3-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3, a member of 
the transforming growth factor (TGF)-/3 superfamily, and the 
homeobox genes, engrailed, goosecoid, twin (Xtwn\ and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP-1 
and WISP-2, and a third related gene, WISPS. The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov, a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA 
Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor; CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C module. 
Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AF100777, 
AF100778, AF100779, AF100780, and AF100781). 
tTo whom reprint requests should be addressed, e-mail: diane@gene. 
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cDNA was synthesized froni 2 /xg of poIy(A)* RNA isolated 
from the C57MG/WnM cell line and driver cDNA from 2 jig 
of poly(A)" RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-] were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
coding full-length mouse and human WISP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WJSP-3 were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 /iM of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P-labeIed sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP- 1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology [Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM ? HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCl cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2<a«) w here ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
d-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The WISP- specific signal was 
normalized to that of the glyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-I and WISP-2 by SSH. To identify Wnt- 
1-inducible genes, we used the technique of SSH using the 
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mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP-I and WISP-2, were differentially 
expressed, being induced in the C57MG/Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. 1 A and B). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on /3-catenin levels (13, 14). Expression of WISP-I was 
up-regulated approximately 3-fold in the C57MG/Wnt-1 cell 
line and WISP-2 by approximately 5 -fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-I gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-I mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-I were isolated and the 
sequence compared with mouse WISP-I . The cDNA sequences 
of mouse and human WISP-I were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of ^40,000 (M T 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. 2A). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of ~ 27,000 (M T 27 K) (Fig. IB). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-linked glycosylation sites, and mouse WISP-2 has one at 

C57MG 
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Fig. 1. WISP-I and WISP-2 are induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-1 {A) and 
WISP-2 (B) expression in C57MG, C57MG/WM-1, and C57MG/ 
Wnt-4 cells. Poly(A) + RNA (2 ^g) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse WW- /-specific probe 
(amino acids 278-300) or a 190-bp WISP- 2- specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
human /3-actin probe. 
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Fic. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-1 {A) and mouse and human WISP-2 (B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-termina! (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-1. 

Identification of WISP-3. To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
W1SP-3 cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 3/4). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-1, WISP-2, and WISP-3 are novel sequences; 
however, mouse WISP-1 is the same as the recently identified 
Elml gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-T (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov, CTGF is a chemotactic and mitogenic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-/3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3B) (21). The N-terminal domain, which includes the First 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor (IGF)- 
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Fig. 3. (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (£) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerization and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 



14720 Cell Biology, Medical Sciences: Pennica et ai 



Proc Natl Acad. ScL USA 95 (1998) 



analysis on adult and fetal multiple tissue cDNA panels. 
W1SP-1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISPS was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISPS 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP-1 and WISP-2. Expression of 
WISP-1 and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WlSP-1 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WISP-1 expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-1, WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 E-H). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 
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Fig. 4. (A, C, E f and G) Representative hematoxylin/eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WJSP-1 expression are shown in B and 
D. The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power {A and £), 
expression of WISP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D), and tumor cells are negative. 
Focal expression of WISP-1, however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in E-H. At low 
power (E and F), expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and H). 



the predominant cell type expressing WISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-1 is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISPS mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISPS is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression of WISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
. assessed by quantitative PCR and Southern blot analysis. (Fig. 
5 A and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISPS was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 
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Fig. 5. Amplification of WISP-1 genomic DNA in colon cancer cell 
lines. {A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (B) Southern blots containing genomic DNA (10 jig) 
digested with EcoRl (WISP-1) or Xba] (c-myc) were hybridized with 
a 100-bp human WISP-1 probe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-1, WISP-3 RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fig, 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient. 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 



mucosa. The amount of overexpression of WISP-3 ranged from 
4- to >40-fold. 



DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three of the genes isolated, WISP-1, WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest- 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., /3-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-l-transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through j3-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-/3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin a v ft serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
- tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-j31, which is the stimulus for 
stromal proliferation (34). TGF-/31 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Writ-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP-I gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP-2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon, 

A recent manuscript on rCop-I, the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and /3-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic /3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. , 
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methods. Peptides AENK or AEQKwere dissolved in water, made isotonic with 
NaCl and diluted into RPM1 growth medium. T-cell-proliferation assays were 
done essentially as described 20,21 . Briefly, after antigen pulsing (30u.gmr' 
TTCF) with tetrapeptides {l-2mgmr'), PBMCs or EBV-B cells were 
washed in PBS and fixed for 45 s in 0.05% glu tar aldehyde. Glycine was added 
to a final concentration of 0.1 M and the cells were washed five times in RPMI 
1640 medium containing 1% FCS before co-culture with T-cell clones in 
round-bottom 96-well microtitre plates. After 48 h, the cultures were pulsed 
with 1 u-Ci of 3 H-thymidine and harvested for scintillation counting 16 h later. 
Predigestion of native TTCF was done by incubating 200 n,g TTCF with 0.25 u.g 
pig kidney legumain in 500 u.1 50 mM citrate buffer, pH 5.5, for 1 h at 37 °C. 
Glycopeptide digestions. The peptides HIDMEEDI, HlDN{N-gIucosamine) 
EEDI and HIDNESDI, which are based on the TTCF sequence, and 
QQQHLFGSNVTDCSGNFCLFR(KKK), which is based on human transferrin, 
were obtained by custom synthesis. The three C-terminal lysine residues were 
added to the natural sequence to aid solubility. The transferrin glycopeptide 
QQQHLFGSNVTDCSGNFCLFR was prepared by tryptic (Promega) digestion 
of 5 mg reduced, carboxy-methylated human transferrin followed by 
concanavalin A chromatography". -Glycopeptides corresponding to residues 
622-642 and 421-452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N-terminal sequencing. The lyophilized transferrin- 
derived peptides were redissolved in 50 mM sodium acetate, pH 5.5, 10 mM 
dithiothreitol, 20% methanol. Digestions were performed for 3 h at 30 °C with 
5-50 mUml - ' pig kidney legumain or B-cell AEP. Products were analysed by 
HPLC or MALDI-TOF mass spectrometry using a matrix of lOmgml" 1 oc- 
cyanocinnamic acid in 50% acetonitrile/0.1% TFA and a PerSeptive Biosystems 
Elite STR mass spectrometer set to linear or reflector mode. Internal standar- 
dization was obtained with a matrix ion of 568.13 mass units. 
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Fas ligand (FasL) is produced by activated T cells and natural 
killer cells and it induces apoptosis (programmed cell death) in 
target cells through the death receptor Fas/Apol/CD95 (ref. 1). 
One important role of FasL and Fas is to mediate imrnune- 
cytotoxic killing of cells that are ^potentially harmful to the 
organism, such as virus-infected or tumour cells 1 . Here we 
report the discovery of a soluble decoy receptor, termed decoy 
receptor 3 (DcR3), that binds to FasL and inhibits FasL-induced 
apoptosis. The DcR3 gene was amplified in about half of 35 
primary lung and colon tumours studied, and DcR3 messenger 
RNA was expressed in malignant tissue. Thus, certain tumours 
may escape FasL-dependent immune-cytotoxic attack by expres- 
sing a decoy receptor that blocks FasL. 

By searching expressed sequence tag (EST) databases, we identi- 
fied a set of related ESTs that showed homology to the tumour 
necrosis factor (TNF) receptor (TNFR) gene superfamily 2 . Using 
the overlapping sequence, we isolated a previously unknown full- 
length complementary DNA from human fetal lung. We named the 
protein encoded by this cDNA decoy receptor 3 (DcR3). The cDNA 
encodes a 300-amino-acid polypeptide that resembles members of 
the TNFR family (Fig. la): the amino terminus contains a leader 
sequence, which is followed by four tandem cysteine-rich domains 
(CRDs). Like one other TNFR homologue, osteoprotegerin (OPG) 3 , 
DcR3 lacks an apparent transmembrane sequence, which indicates 
that it may be a secreted, rather than a membrane-asscociated, 
molecule. We expressed a recombinant, histidine-tagged form of 
DcR3 in mammalian cells; DcR3 was secreted into the cell culture 
medium, and migrated on polyacrylamide gels as a protein of 
relative molecular mass 35,000 (data not shown). DcR3 shares 
sequence identity in particular with OPG (31%) and TNFR2 
(29%), and has relatively less homology with Fas (17%). All of 
the cysteines in the four CRDs of DcR3 and OPG are conserved; 
however, the carboxy- terminal portion of DcR3 is 101 residues 
shorter. 

We analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig. lb). We detected a predominant 1.2-kilobase 
transcript in fetal lung, brain, and liver, and in adult spleen, colon 
and lung. In addition, we observed relatively high DcR3 mRNA 
expression in the human colon carcinoma cell line SW480. 

To investigate potential ligand interactions of DcR3, we generated 
a recombinant, Fc-tagged DcR3 protein. We tested binding of 
DcR3-Fc to human 293 cells transfected. with individual TNF- 
family ligands, which are expressed as type 2 transmembrane 
proteins (these transmembrane proteins have their N termini in 
the cytosol). DcR3-Fc showed a significant increase in binding to 
cells transfected with FasL 4 (Fig. 2a), but not to cells transfected with 
TNF 5 , Apo2L/TRAIL 6 * 7 , Apo3L/TWEAK 8,9 , or OPGL/TRANCE/ 
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RANKL 10 "' 2 (data not shown). DcR3-Fc immunoprecipitated shed 
FasL from FasL-transfected 293 cells (Fig! 2b) and purified soluble 
FasL (Fig. 2c), as did the Fc-tagged ectodomain of Fas but not 
TNFR1. Gel- filtration chromatography showed that DcR3-Fc and 
soluble FasL formed a stable complex (Fig. 2d). Equilibrium 
analysis indicated that DcR3-Fc and Fas-Fc bound to soluble 
FasL with a comparable affinity (K d = 0.8 ± 0.2 and 
l.l±0.1nM, respectively; Fig. 2e), and that DcR3-Fc could 
block nearly all of the binding of soluble FasL to Fas-Fc (Fig. 2e, 
inset). Thus, DcR3 competes with Fas for binding to FasL. 

To determine whether binding of DcR3 inhibits FasL activity, we 
tested the effect of DcR3-Fc on apoptosis induction by soluble 
FasL in Jurkat T leukaemia cells, which express Fas (Fig. 3a). DcR3- 
Fc and Fas-Fc blocked soluble-FasL-induced apoptosis in a 
similar dose-dependent manner, with half-maximal inhibition at 
—0.1 n-gml" 1 . Time-course analysis showed that the inhibition did 
not merely delay cell death, but rather persisted for at least 24 hours 
(Fig. 3b). We also tested the effect of DcR3-Fc on activation- 
induced cell death (AICD) of mature T lymphocytes, a FasL- 
dependent process 1 . Consistent with previous results 13 , activation 
of interleukin-2-stimulated CD4-positive T cells with anti-CD3 
antibody increased the level of apoptosis twofold, and Fas-Fc 
blocked this effect substantially (Fig. 3c); DcR3-Fc blocked the 
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Figure 1 Primary structure and expression of human DcR3. a, Alignment of the 
amino-acid sequences of DcR3 and of osteoprotegerin (OPG); the C-terminal 101 
residues of OPG are not shown. The putative signal cleavage site (arrow), the 
cysteine-rich domains (CRD 1-4), and the AMinked glycosylation site (asterisk) are 
shown, b. Expression of DcR3 mRNA. Northern hybridization analysis was done 
using the DcR3 cDNA as a probe and blots of poly(A)* RNA (Clontech) from 
human fetal and adult tissues or cancer ceil lines. PBL, peripheral blood 
lymphocyte. 



induction of apoptosis to a similar extent. Thus, DcR3 binding 
blocks apoptosis induction by FasL. 

FasL-induced apoptosis is important in elimination of virus- 
infected cells and cancer cells by natural killer cells and cytotoxic T 
lymphocytes; an alternative mechanism involves perforin and 
granzymes'' 14 " 1 ". Peripheral blood natural killer cells triggered 
marked cell death in Jurkat T leukaemia cells (Fig. 3d); DcR3-Fc 
and Fas-Fc each reduced killing of target cells from —65% to 
— 30%, with half-maximal inhibition at —1 jxgml" 1 ; the residual 
killing was probably mediated by the perforin/granzyme pathway. 
Thus, DcR3 binding blocks FasL-dependent natural killer cell 
activity. Higher DcR3-Fc and Fas-Fc concentrations were required 
to block natural killer cell activity compared with those required to 
block soluble FasL activity, which is consistent with the greater 
potency of membrane-associated FasL compared with soluble 
FasL 17 . 

Given the role of immune-cytotoxic cells in elimination of 
tumour cells and the fact that DcR3 can act as an inhibitor of 
FasL, we proposed that DcR3 expression might contribute to the 
ability of some tumours to escape immune-cytotoxic attack. As 
genomic amplification frequently contributes to tumorigenesis, we 
investigated whether the DcR3 gene is amplified in cancer. We 
analysed DcR3 gene-copy number by quantitative polymerase chain 
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Figure 2 Interaction of DcR3 with FasL. a, 293 cells were transfected with pRK5 
vector (top) or with pRK5 encoding full-length FasL (bottom), incubated with 
DcR3-Fc (solid line, shaded area). TNFR1-Fc (dotted line) or buffer control 
(dashed line) (the dashed and dotted lines overlap), and analysed for binding by 
FACS. Statistical analysis showed a significant difference {P < 0.001) between the 
binding of DcR3-Fc to ceils transfected with FasL or pRK5. PE. phycoeryth re- 
labelled celts, b. 293 cells were transfected as in a and metabolically labelled, and 
cell supernatants were immunoprecipitated with Fc-tagged TNFRt, DcR3 or Fas. 
c, Purified soluble FasL (sFasL) was immunoprecipitated with TNFR1-Fc, DcR3- 
Fc or Fas-Fc and visualized by immunoblot with anti-FasL antibody. sFasL was 
loaded directly for comparison in the right-hand lane, d, Flag-tagged sFasL was 
incubated with DcR3-Fc or with buffer and resolved by gel filtration; column 
fractions were analysed in an assay that detects complexes containing DcR3-Fc 
and sFasL-Flag. e, Equilibrium binding of DcR3-Fc or Fas-Fc to sFasL-Flag. 
Inset, competition of DcR3-Fc with Fas-Fc for binding to sFasL-Flag. 
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reaction (PCR) tfl in genomic DNA from 35 primary lung and colon 
tumours, relative to pooled genomic DNA from peripheral blood 
leukocytes (PBLs) of 10 healthy donors. Eight of 18 lung tumours 
and 9 of 17 colon tumours showed DcR3 gene amplification, 
ranging from 2- to 18-fold (Fig. 4a, b). To confirm this result, we 
analysed the colon tumour DNAs with three more, independent sets 
of DcR3 -based PCR primers and probes; we observed nearly the 
same amplification (data not shown). 

We then analysed DcR3 mRNA expression in primary tumour 
tissue sections by in situ hybridization. We detected DcR3 expres- 
sion in 6 out of 1 5 lung rumours, 2 out of 2 colon tumours, 2 out of 5 
breast tumours, and 1 out of 1 gastric tumour (data not shown). A 
section through a squamous-cell carcinoma of the lung is shown in 
Fig. 4c. DcR3 mRNA was localized to infiltrating malignant epithe- 
lium, but was essentially absent from adjacent stroma, indicating 
tumour-specific expression. Although the individual tumour speci- 
mens that we analysed for mRNA expression and gene amplification 
were different, the in situ hybridization results are consistent with 
the finding that the DcR3 gene: is amplified frequently in tumours. 
SW480 colon carcinoma cells, which showed abundant DcR3 
mRNA expression (Fig. lb), also had marked DcR3 gene amplifica- 
tion, as shown by quantitative PCR (fourfold) and by Southern blot 
hybridization (fivefold) (data not shown). 

If DcR3 amplification in cancer is functionally relevant, then 
DcR3 should be amplified more than neighbouring genomic 
regions that are not important for tumour survival. To test this, 



we mapped the human DcR3 gene by radiation-hybrid analysis; 
DcR3 showed linkage to marker AFM218xe7 (T 160), which maps to 
chromosome position 20ql3. Next, we isolated from a bacterial 
artificial chromosome (BAC) library a human genomic clone that 
carries DcR3, and sequenced the ends of the cloned insert. We then 
determined, from the nine colon tumours that showed twofold or 
greater amplification of DcR3, the copy number of the DcR3- 
flanking sequences (reverse and forward) from the BAC, and of 
seven genomic markers that span chromosome 20 (Fig. 4d). The 
DcR3 -linked reverse marker showed an average amplification of 
roughly threefold, slightly less than the approximately fourfold 
amplification of DcR3; the other markers showed little or no 
amplification. These data indicate that DcR3 may be at the 'epi- 
centre* of a distal chromosome 20 region that is amplified in colon 
cancer, consistent with the possibility that DcR3 amplification 
promotes tumour survival. 

Our results show that DcR3 binds specifically to FasL and inhibits 
FasL activity. We did not detect DcR3 binding to several other TNF- 
ligand- family members; however, this does not rule out the possi- 
bility that DcR3 interacts with other ligands, as do some other 
TNFR family members, including OPG 2 * 19 . 

FasL is important in regulating the immune response; however, 
little is known about how FasL function is controlled. One mechan- 
ism involves the molecule cFLIP, which modulates apoptosis signal- 
ling downstream of Fas 20 . A second mechanism involves proteolytic 
shedding of FasL from the cell surface 17 . DcR3 competes with Fas for 




10-2 10- 1 10° 
Inhibitor (uo, ml -1 ) 



PBS IgG Fas DcR3 




80 



I 40 



10 20 
Time (h) 



10- 1 10° 10*> 
Inhibitor (jig ml -1 ) 



Figure 3 Inhibition of FasL activity by DcR3. a, Human Jurkat T leukaemia cells 
were incubated with Flag-tagged soluble FasL (sFasL;. 5ngml _1 ) oligomerized 
with anti-Flag antibody (0.1 ngmr"') in the presence of the proposed inhibitors 
DcR3-Fc, Fas-Fc or human IgGl and assayed for apoptosis (mean ± s.e.m. of 
triplicates), b. Jurkat cells were incubated with sFasL-Flag.plus anti-Flag antibody 
as in a. in presence of 1 M-g ml" 1 DcR3-Fc (filled circles). Fas-Fc (open circles) or 
human IgGl (triangles), and apoptosis was determined at the indicated time 
points, c. Peripheral blood T cells were stimulated with PHA and interleukin-2, 
followed by control (white bars) or anti-CD3 antibody (filled bars), together with 
phosphate-buffered saline (PBS), human IgGl. Fas-Fc. or DcR3-Fc (lOpflmr 1 ). 
After 16 h, apoptosis of CD4* cells was determined (mean ± s.e.m. of results from 
five donors), d. Peripheral blood natural killer cells were incubated with 5 'Cr- 
labelled Jurkat cells in the presence of DcR3-Fc (filled circles), Fas-Fc (open 
circles) or human IgGl (triangles), and target-cell death was determined by 
release of 6l Cr (mean ± s.d. for two donors, each in triplicate). 
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Figure 4 Genomic amplification of DcR3 in tumours, a. Lung cancers, comprising 
eight adenocarcinomas (c. d. f, g, h, j, k. r), seven squamous-cell carcinomas (a, e. 
m, n, o, p. q), one non-small-cell carcinoma (b), one small-cell carcinoma (i), and 
one bronchial adenocarcinoma (I). The data are means ± s.d. of 2 experiments 
done in duplicate, b, Colon tumours, comprising 17 adenocarcinomas. Data are 
means * s.e.m. of five experiments done in duplicate, c, In situ hybridization . 
analysis of DcR3 mRNA expression in a squamous-cell carcinoma of the lung. A 
representative bright-field image (left) and the corresponding dark-field image 
(right) show DcR3 mRNA over infiltrating malignant epithelium (arrowheads). 
Adjacent non-malignant stroma (S), blood vessel (V) and necrotic tumour tissue 
(N) are also shown, d, Average amplification of DcR3 compared with amplifica- 
tion of neighbouring genomic regions (reverse and forward. Rev and Fwd), the 
DcR3-linked marker T160. and other chromosome-20 markers, in the nine colon 
tumours showing DcR3 amplification of twofold or more (b). Data are from two 
experiments done in duplicate. Asterisk indicates P < 0.01 for a Student's Hest 
comparing each marker with DcR3. 
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FasL binding; hence, it may represent a third mechanism of 
extracellular regulation of FasL activity. A decoy receptor that 
modulates the function of the cytokine interleukin-1 has been 
described 21 . In addition, two decoy receptors that belong to the 
TNFR family, DcRl and DcR2, regulate the FasL-related apoptosis- 
inducing molecule Apo2L 22 . Unlike DcRl and DcR2, which are 
membrane-associated proteins, DcR3 is directly secreted into the 
extracellular space. One other secreted TNFR- family member is 
OPG 3 , which shares greater sequence homology with DcR3 (31%) 
than do DcRl (17%) or DcR2 (19%); OPG functions as a third 
decoy for Apo2L 19 . Thus, DcR3 and OPG define a new subset of 
TNFR-family members that function as secreted decoys to mod- 
ulate ligands that induce apoptosis. Pox viruses produce soluble 
TNFR homologues that neutralize specific TNF- family ligands, 
thereby modulating the antiviral immune response 2 . Our results 
indicate that a similar mechanism, namely, production of a soluble 
decoy receptor for FasL, may contribute to immune evasion by 
certain tumours. Q 

Methods 

Isolation of DcR3 cONA. Several overlapping ESTs in GenBank (accession 
numbers AA025672, AA025673 and W67560) and in Lifeseq™ (Incyle 
Pharmaceuticals; accession numbers 1339238, 1533571, 1533650, 1542861, 
1789372 and 2207027) showed similarity to members of the TNFR family. We 
screened human cDNA libraries by PCR with primers based on the region of 
EST consensus; fetal lung was positive for a product of the expected size. By 
hybridization to a PCR-generated probe based on the ESTs, one positive clone 
(DNA30942) was identified. When searching for potential alternatively spliced 
forms of DcR3 that might encode a transmembrane protein, we isolated 50 
more clones; the coding regions of these clones were identical in size to that of 
the initial clone (data not shown). 

Fc-fusion proteins (immunoadhesins). The entire DcR3 sequence, or the 
ectodomain of Fas or TNFR1, was fused to the hinge and Fc region of human 
IgGl, expressed in insect SF9 cells or in human 293 cells, and purified as 
described 23 . 

Fluorescence-activated cell sorting (FACS) analysis. We transfected 293 
cells using calcium phosphate or Effectene (Qiagen) with pRK5 vector or pRK5 
encoding full-length human FasL 4 (2 u.g), together with pRK5 encoding CrmA 
(2jxg) to prevent cell death. After 16 h, the cells were incubated with 
biotinylated DcR3-I;c or TNFRl-Fc and then with phycoerythrin -conjugated 
streptavidin (GibcoBRL), and were assayed by FACS. The data were analysed by 
Kolmogorov-Smirnov statistical analysis. There was some detectable staining 
of vector-transfected cells by DcR3-Fc; as these cells express little FasL (data 
not shown), it is possible that DcR3 recognized some other factor that is 
expressed constitutively on 293 cells. 

Immunoprecipitation. Human 293 cells were transfected as above, and 
metabolically labelled with [ 35 S]cysteine and [ 3S S] methionine (0.5 mCi; 
Amersham). After 16 h of culture in the presence of z-YAD-frnk (IOjxM), 
the medium was immunoprecipitated with DcR3-Fc, Fas-Fc or TNFRl-Fc 
(5 fig), followed by protein A-Sepharose (RepHgen). The precipitates were 
resolved by SDS-PAGE and visualized on a phosphorimager (Fuji BAS2000). 
Alternatively, purified, Flag-tagged soluble FasL (1 u-g) (Alexis) was incubated 
with each Fc-fusion protein (1 jxg), precipitated with protein A-Sepharose, 
resolved by SDS-PAGE and visualized by immunoblotting with rabbit anti- 
FasL antibody (Oncogene Research). 

Analysis of complex formation. Flag- tagged soluble FasL (25 u,g) was 
incubated with buffer or with DcR3-Fc (40 u,g) for 1.5 h at 24 C C. The reaction 
was loaded onto a Superdex 200 HR 10/30 column (Pharmacia) and developed 
with PBS; 0.6-ml fractions were collected. The presence of DcR3-Fc-FasL 
complex in each fraction was analysed by placing 1 00 u-1 aliquots into microtitre 
wells precoated with anti-human lgG (Boehringer) to capture DcR3-Fc, 
followed by detection with biotinylated anti-Flag antibody Bio M2 (Kodak) and 
streptavidin-horseradish peroxidase (Amersham). Calibration of the column 
indicated an apparent relative molecular mass of the complex of 420K (data not 
shown), which is consistent with a stoichiometry of two DcR3-Fc homodimers 
to two soluble FasL homo trim ers. 

Equilibrium binding analysis. Microtitre wells were coated with anti-human 



IgG, blocked with 2% BSA in PBS. DcR3-Fc or Fas-Fc was added, followed by 
serially diluted Flag-tagged soluble FasL. Bound ligand was detected with anti- 
Flag antibody as above. In the competition assay, Fas-Fc was immobilized as 
above, and the wells were blocked with excess IgGl before addition of Flag- 
tagged soluble FasL plus DcR3-Fc. 

T-cell AICD. CD3 + lymphocytes were isolated from peripheral blood of 
individual donors using anti-CD3 magnetic beads (Miltenyi Biotech), 
stimulated with phytohaemagglutinin (PHA; 2 u,g ml" 1 ) for 24 h, and cultured 
in the presence of interleukin-2 (100 U ml" 1 ) for 5 days. The cells were plated in 
wells coated with anti-CD3 antibody (Pharmingen) and analysed for apoptosis 
16 h later. by FACS analysis of annexin-V-binding of CD4 + cells 24 . 
Natural killer cell activity. Natural killer cells were isolated from peripheral 
blood of individual donors using anti-CD56 magnetic beads (Miltenyi 
Biotech), and incubated for 16 h with 5l Cr-loaded Jurkat cells at an effector- 
to-target ratio of 1:1 in the presence of DcR3-Fc, Fas-Fc or human IgGl. 
Target -cell death was determined by release of *'Cr in effector- target co- 
cultures relative to release of 51 Cr by detergent lysis of equal numbers of Jurkat 
cells. 

Gene-amplification analysis. Surgical specimens were provided by J. Kern 
(lung tumours) and P. Quirke (colon tumours). Genomic DNA was extracted 
(Qiagen) and the concentration was determined using Hoechst dye 33258 
intercalation fluorometry. Amplification was determined by quantitative PCR 1 * 
using a TaqMan instrument (ABI). The method was validated by comparison of 
PCR and Southern hybridization data for the Myc and HER- 2 oncogenes (data 
not shown). Gene-specific primers and fluorogenic probes were designed on 
the basis of the sequence of DcR3 or of nearby regions identified on a BAC 
carrying the human DcR3 gene; alternatively, primers and probes were based 
on Stanford Human Genome Center marker AFM218xe7 (T160), which is 
linked to DcR3 (likelihood score = 5.4), SHGC-36268 (T159), the nearest 
available marker which maps to —500 kilobases from T160, and five extra 
markers that span chromosome 20. The DcR3-specific primer sequences were 
5'-CTTCTTCGCGCACGCTG-3' and 5'-ATCACGCCGGCACCAG-3' and the 
fluorogenic probe sequence was 5 ' - ( FAM -ACACGATGCGTGCTCCAAGCAG 
AAp-(TAMARA), where FAM is 5' -fluorescein phosphoramidite. Relative 
gene- copy numbers were derived using the formula 2* 4CTJ , where ACT is the 
difference in amplification cycles required to detect DcR3 in peripheral blood 
lymphocyte DNA compared to test DNA. 
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ABC transporters (also known as traffic ATPases) form a large 
family of proteins responsible for the translocation of a variety 
of compounds across membranes of both prokaryotes and 
eukaryotes 1 . The recently completed Escherichia coli genome 
sequence revealed that the largest family of paralogous E coli 
proteins is composed of ABC transporters 2 . Many eukaryotic 
proteins of medical significance belong to this family, such as 
the cystic fibrosis transmembrane conductance regulator (CFTR), 
the P-glycoprotein (or multidrug-resistance protein) and the 
heterodimeric transporter associated With antigen processing 
(Tapl-Tap2). Here we report the crystal structure at 1.5 A resolu- 
tion of HisP, the ATP-binding subunit of the histidine permease, 
which is an ABC transporter from Salmonella typhimurium. We 
correlate the details of this structure with the biochemical, genetic 
and biophysical properties of the wild-type and several mutant 
HisP proteins. The structure provides a basis for understanding 
properties of ABC transporters and of defective CFTR proteins. 

ABC transporters contain four structural domains: two nucleo- 
tide-binding domains (NBDs), which are highly conserved 
throughout the family, and two transmembrane domains'. In 
prokaryotes these domains are often separate subunits which are 
assembled into a membrane-bound complex; in eukaryotes the 
domains are generally fused into a single polypeptide chain. The 
periplasmic histidine permease of S. typhimurium and £. coW"*'* is a 
well-characterized ABC transporter that is a good model for this 
superfamily. It consists of a membrane-bound complex, HisQMP 2 , 
which comprises integral membrane subunits, HisQ and HisM, and 
two copies of HisP, the ATP-binding subunit. HisP, which has 
properties intermediate between those of integral and peripheral 
membrane proteins 9 , is accessible from both sides of the membrane, 
presumably by its interaction with HisQ and HisM 6 . The two HisP 
subunits form a dimer, as shown by their cooperativity in ATP 
hydrolysis 5 , the requirement for both subunits to be present for 
activity*, and the formation of a HisP dimer upon chemical cross- 
linking. Soluble HisP also forms a dimer 3 . HisP has been purified 
and characterized in an active soluble form 3 which can be recon- 
stituted into a fully active membrane-bound complex 8 . 

The overall shape of the crystal structure of the HisP monomer is 
that of an 'L* with two thick arms (arm I and arm II); the ATP- 
binding pocket is near the end of arm I (Fig. 1). A six-stranded 
sheet (p3 and (B8-pl2) spans both arms of the L, with a domain of a 
ct- plus (3-rype structure ((31, (32, P4-37, ctl and ct2) on one side 
(within arm I) and a domain of mostly ct-helices (ct3-ot9) on the 
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Figure 1 Crystal structure of HisP. a, View of the dimer along an axis 
perpendicular to its two-fold axis. The top and bottom of the dimer are suggested 
to face towards the periplasmic and cytoplasmic sides, respectively {see text). 
TKe thickness of arm II is about 25 A, comparable to that of membrane. a-Helices 
are shown in orange and p-sheets in green, b, View along the two-fold axis of the 
HisP dimer, showing the relative displacement of the monomers not apparent in 
a. The p-strands at the dimer interface are labelled, c, View of one monomer from 
the bottom. of arm I, as shown in a, towards arm II, showing the ATP-binding 
pocket, a-c, The protein and the bound ATP are in 'ribbon' and 'bail-and-stick' 
representations, respectively. Key residues discussed in the text are indicated in 
c. These figures were prepared with MOLSCRIPT 29 . N, amino terminus; C, C 
terminus. 
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Gene amplification is a common event in the progression of 
human cancers, and amplified oncogenes have been shown to 
have diagnostic, prognostic and therapeutic relevance. A 
kinetic quantitative polymerase-chain-reaction (PCR) method, 
based on fluorescent TaqMan methodology and a new instru- 
ment (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real-time, was used to quantify 
gene amplification in tumor DIM A. Reactions are character- 
ized by the point during cycling when PCR amplification is still 
in the exponential phase, rather than the amount of PCR 
product accumulated after a fixed number of cycles. None of 
the reaction components is limited during the exponential 
phase, meaning that values are highly reproducible in reac- 
tions starting with the same copy number. This greatly 
improves the precision of DNA quantification. Moreover, 
real-time PCR does not require post-PCR sample handling, 
thereby preventing potential PCR-product carry-over con- 
tamination; it possesses a wide dynamic range of quantifica- 
tion and results in much faster and higher sample throughput. 
The real-time PCR method, was used to develop and validate 
a simple and rapid assay for the detection and quantification 
of the 3 most frequently amplified genes (myc, ccndl and 
erbB2) in breast tumors. Extra copies of myc, ccndl and erbB2 
were observed in 10, 23 and 15%, respectively, of 108 breast- 
tumor DNA; the largest observed numbers of gene copies 
were 4.6, 18.6 and 15.1, respectively. These results correlated 
well with those of Southern blotting. The use of this new 
semi-automated technique will make molecular analysis of 
human cancers simpler and more reliable, and should find 
broad applications in clinical and research settings, int. J. 
Cancer 78:661-666, 1998. 
© J 998 Wiley-Liss, Inc. 

Gene amplification plays an important role in the pathogenesis 
of various solid tumors, including breast cancer, probably because 
over-expression of the amplified target genes confers a selective 
advantage. The first technique used to detect genomic amplification 
was cytogenetic analysis. Amplification of several chromosome 
regions, visualized either as extrachromosomal double minutes 
(dmins) or as integrated homogeneously staining regions (HSRs), 
are among the main visible cytogenetic abnormalities in breast 
tumors. Other techniques such as comparative genomic hybridiza- 
tion (CGH) (Kallioniemi et ai, 1 994) have also been used in broad 
searches for regions of increased DNA copy numbers in tumor 
cells, and have revealed some 20 amplified chromosome regions in 
breast tumors. Positional cloning efforts are underway to identify 
the critical gene(s) in each amplified region. To date, genes known 
to be amplified frequently in breast cancers include myc (8q24), 
ccnd\ ( ll q 1 3), and erbB2 ( 1 7q 1 2-q2 1 ) (for review, see Bieche and 
Lidereau, 1995). 

Amplification of the myc, ccndl, and erbB2 proto-oncogenes 
should have clinical relevance in breast cancer, since independent 
studies have shown that these alterations can be used to identify 
sub-populations with a worse prognosis (Bems et al, 1992; 
Schuuring et al, 1992; Siamon et aL 1987). Muss et al (1994) 
suggested that these gene alterations may also be useful for the 
prediction and assessment of the efficacy of adjuvant chemotherapy 
and hormone therapy. 

However, published results diverge both in terms of the fre- 
quency of these alterations and their clinical value. For instance, 
over 500 studies in 10 years have failed to resolve the controversy 



surrounding the link suggested by Slamon et al (1987) between 
erb&l amplification and disease progression. These discrepancies 
are partly due to the clinical, histological and ethnic heterogeneity 
of breast cancer, but technical considerations are also probably 
involved. 

Specific genes (DNA) were initially quantified in tumor cells by 
means of blotting procedures such as Southern and slot blotting. 
These batch techniques require large amounts of DNA (5-10 
ug/reaction) to yield reliable quantitative results. Furthermore, 
meticulous care is required at all stages of the procedures to 
generate blots of sufficient quality for reliable dosage analysis. 
Recently, PCR has proven to be a powerful tool for quantitative 
DNA analysis, especially with minimal starting quantities of tumor 
samples (small, early-stage tumors and formalin-fixed, paraffin- 
embedded tissues). 

Quantitative PCR can be performed by evaluating the amount of 
product either after a given number of cycles (end-point quantita- 
tive PCR) or after a varying number of cycles during the 
exponential phase (kinetic quantitative PCR). In the first case, an 
internal standard distinct from the target molecule is required to 
ascertain PCR efficiency. The method is relatively easy but implies 
generating, quantifying and storing an internal standard for each 
gene studied. Nevertheless, it is the most frequently applied 
method to date. 

One of the major advantages of the kinetic method is its rapidity 
in quantifying a new gene, since no internal standard is required (an 
external standard curve is sufficient). Moreover, the kinetic method 
has a wide dynamic range (at least 5 orders of magnitude), giving 
an accurate value for samples differing in their copy number. 
Unfortunately, the method is cumbersome and has therefore been 
rarely used. It involves aliquot sampling of each assay mix at 
regular intervals and quantifying, for each aliquot, the amplifica- 
tion product. Interest in the kinetic method has been stimulated by a 
novel approach using fluorescent TaqMan methodology and a new 
instrument (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real time (Gibson et al, 1996; Heid et 
al, 1996). The TaqMan reaction is based on the 5' nuclease assay 
first described by Holland et al (1991). The latter uses the 5' 
nuclease activity of Taq polymerase to cleave a specific fluorogenic 
oligonucleotide probe during the extension phase of PCR. The 
approach uses dual-labeled fluorogenic hybridization probes (Lee 
et al, 1993). One fluorescent dye, co-valently linked to the 5' end 
of the oligonucleotide, serves as a reporter [FAM (i.e., 6-carboxy- 
fluorescein)] and its emission spectrum is quenched by a second 
fluorescent dye, TAMRA (i.e., 6-carboxy-tetramethyl-rhodamine) 
attached to the 3' end. During the extension phase of the PCR 
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cycle, the fluorescent hybridization probe is hydrolyzed by the 
5 '-3' nucleolytic activity of DNA polymerase. Nuclease degrada- 
tion of the probe releases the quenching of FAM fluorescence 
emission, resulting in an increase in peak fluorescence emission. 
The fluorescence signal is normalized by dividing the emission 
intensity of the reporter dye (FAM) by the emission intensity of a 
reference dye (i.e., ROX, 6-carboxy-X-rhodamine) included in 
TaqMan buffer, to obtain a ratio defined as the Rn (normalized 
reporter) for a given reaction tube. The use of a sequence detector 
enables the fluorescence spectra of all 96 wells of the thermal 
cycler to be measured continuously during PCR amplification. 

The real-time PCR method offers several advantages over other 
current quantitative PCR methods (Celi et al, 1994): (i) the 
probe-based homogeneous assay provides a real-time method for 
detecting only specific amplification products, since specific hybri- 
dation of both the primers and the probe is necessary to generate a 
signal; (ii) the Q (threshold cycle) value used for quantification is 
measured when PCR amplification is still in the log phase of PCR 
product accumulation. This is the main reason why Q is a more 
reliable measure of the starting copy number than are end-point 
measurements, in which a slight difference in a limiting component 
can have a drastic effect on the amount of product; (Hi) use of C, 
values gives a wider dynamic range (at least 5 orders of magni- 
tude), reducing the need for serial dilution; (iv) The real-time PCR 
method is run in a closed-tube system and requires no post-PCR 
sample handling, thus avoiding potential contamination; (v) the 
system is highly automated, since the instrument continuously 
measures fluorescence in all 96 wells of the thermal cycler during 
PCR amplification and the corresponding software processes, and 
analyzes the fluorescence data; (vi) the assay is rapid, as results are 
available just one minute after thermal cycling is complete; (vii) the 
sample throughput of the method is high, since 96 reactions can be 
analyzed in 2 hr. 

Here, we applied this semi-automated procedure to determine 
the copy numbers of the 3 most frequently amplified genes in breast 
rumors (myc, ccndl and erbB2)< as well as 2 genes (alb and app) 
located in a chromosome region in which no genetic changes have 
been observed in breast tumors. The results for 108 breast tumors 
were compared with previous Southem-blot data for the same 
samples. 



MATERIAL AND METHODS 
Tumor and blood samples 

Samples were obtained from 1 08 primary breast tumors removed 
surgically from patients at the Centre Rene Huguenin; none of the 
patients had undergone radiotherapy or chemotherapy. Immedi- 
ately after surgery, the tumor samples were placed in liquid 
nitrogen until extraction of high-molccular-weight DNA. Patients 
were included in this study if the tumor sample used for DNA 
preparation contained more than 60% of tumor cells (histological 
analysis). A blood sample was also taken from 18 of the same 
patients. 

DNA was extracted from tumor tissue and blood leukocytes 
according to standard methods. 

Real-time PCR 

Theoretical basis. Reactions are characterized by the point 
during cycling when amplification of the PCR product is first 
detected, rather than by the amount of PCR product accumulated 
after a fixed number of cycles. The higher the starting copy number 
of the genomic DNA target, the earlier a significant increase in 
fluorescence is observed. The parameter C t (threshold cycle) is 
defined as the fractional cycle number at which the fluorescence 
generated by cleavage of the probe passes a fixed threshold above 
baseline. The target gene copy number in unknown samples is 
quantified by measuring C t and by using a standard curve to 
determine the starting copy number. The precise amount of 
genomic DNA (based on optical density) and its quality (i.e., lack 



of extensive degradation) are both difficult to assess. We therefore 
also quantified a control gene (alb) mapping to chromosome region 
4qll-ql3, in which no genetic alterations have been found in 
breast-tumor DNA by means of CGH (Kallioniemi et al, 1 994). 

Thus, the ratio of the copy number of the target gene to the copy 
number of the alb gene normalizes the amount and quality of 
genomic DNA. The ratio defining the level of amplification is 
termed "N", and is determined as follows: 

copy number of target gene (app, myc, ccndl, erbBl) 

N = ■ . 

copy number of reference gene (alb) 

Primers, probes, reference human genomic DNA and PCR 
consumables. Primers and probes were chosen with the assistance 
of the computer programs Oligo 4.0 (National Biosciences, Ply- 
mouth, MN), EuGene (Daniben Systems, Cincinnati, OH) and Primer 
Express (Perkin-Elmer Applied Biosystems, Foster City, CA). 

Primers were purchased from DN Agency (Malvern, PA) and 
probes from Perkin-Elmer Applied Biosystems. 

Nucleotide sequences for the oligonucleotide hybridization 
probes and primers are available on request. 

The TaqMan PCR Core reagent kit, MicroAmp optical tubes, 
and MicroAmp caps were from Perkin-Elmer Applied Biosystems. 

Standard-curve construction. The kinetic method requires a 
standard curve. The latter was constructed with serial dilutions of 
specific PCR products, according to Piatak et al. (1993). In 
practice, each specific PCR product was obtained by amplifying 20 
ng of a standard human genomic DNA (Boehringer, Mannheim, 
Germany) with the same primer pairs as those used later for 
real-time quantitative PCR. The 5 PCR products were purified 
using MicroSpin S-400 HR columns (Pharmacia, Uppsala, Swe- 
den) electrophorezed through an acrylamide gel and stained with 
ethidium bromide to check their quality. The PCR products were 
then quantified spectrophotometrically and pooled, and serially 
diluted 10-fold in mouse genomic DNA (Clontech, Palo Alto, CA) 
at a constant concentration of 2 ng/ul. The standard curve used for 
real-time quantitative PCR was based on serial dilutions of the pool 
of PCR products ranging from 10~ 7 (10 5 copies of each gene) to 
10" 10 (10 2 copies). This series of diluted PCR products was 
aliquoted and stored at — 80°C until use. 

The standard curve was validated by analyzing 2 known 
quantities of calibrator human genomic DNA (20 ng and 50 hg). 

PCR amplification. Amplification mixes (50 ul) contained the 
sample DNA (around 20 ng, around 6600 copies of disomic genes), 
10X TaqMan buffer (5 ul), 200 uM dATP, dCTP, dGTP, and 400 
uM dUTP, 5 mM MgCl 2 , 1.25 units of AmpliTaq Gold, 0.5 units of 
AmpErase uracil N-glycosylase (UNG), 200 nM each primer and 
100 nM probe. The thermal cycling conditions comprised 2 min at 
50°C and 10 min at 95°C. Thermal cycling consisted of 40 cycles at 
95°C for 15 s and 65°C for 1 min. Each assay included: a standard 
curve (from 1 0 5 to 1 0 2 copies) in duplicate, a no-template control, 
20 ng and 50 ng of calibrator human genomic DNA (Boehringer) in 
triplicate, and about 20 ng of unknown genomic DNA in triplicate 
(26 samples can thus be analyzed on a 96-well microplate). All 
samples with a coefficient of variation (CV) higher than 10% were 
retested. 

All reactions were performed in the ABI Prism 7700 Sequence 
Detection System (Perkin-Elmer Applied Biosystems), which 
detects the signal from the fluorogenic probe during PCR. 

Equipment for real-time detection. The 7700 system has a 
built-in thermal cycler and a laser directed via fiber optical cables 
to each of the 96 sample wells. A charge-coupled-device (CDD) 
camera collects the emission from each sample and the data are 
analyzed automatically. The software accompanying the 7700 
system calculates C t and determines the starting copy number in the 
samples. 
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Determination of gene amplification. Gene amplification was 
calculated as described above. Only samples with an N value 
higher than 2 were considered to be amplified. 

RESULTS 

To validate the method, real-time PCR was performed on 
genomic DNA extracted from 108 primary breast tumors, and 18 
normal leukocyte DNA samples from some of the same patients. 
The target genes were the myc, ccndl and erbBl proto-oncogenes, 
and the p-amyloid precursor protein gene (app), which maps to a 
chromosome region (21q21.2) in which no genetic alterations have 
been found in breast tumors (Kallioniemi et ai, 1994). The 
reference disomic gene was the albumin gene (alb, chromosome 
4qll-ql3). 



Validation of the standard curve and dynamic range 
of real-time PCR 

The standard curve was constructed from PCR products serially 
diluted in genomic mouse DNA at a constant concentration of 
2 ng/u.1. It should be noted that the 5 primer pairs chosen to analyze 
the 5 target genes do not amplify genomic mouse DNA (data not 
shown). Figure 1 shows the real-time PCR standard curve for the 
alb gene. The dynamic range was wide (at least 4 orders of 
magnitude), with samples containing as few as 10 : copies or as 
many as 10 5 copies. 

Copy-number ratio of the 2 reference genes (app and a\b) 

The app to alb copy-number ratio was determined in 1 8 normal 
leukocyte DNA samples and all 108 primary breast-tumor DNA 
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Figure 1 - Albumin (alb) gene dosage by real-time PCR. Top: Amplification plots for reactions with starting alb gene copy number ranging 
from 10 3 (A9), 10* (A7), 10* (A4) to 10 2 (A2) and a no-template control (Al). Cycle number is plotted vs. change in normalized reporter signal 
(ARn). For each reaction tube, the fluorescence signal of the reporter dye (FAM) is divided by the fluorescence signal of the passive reference dye 
(ROX), to obtain a ratio defined as the normalized reporter signal (Rn). ARn represents the normalized reporter signal (Rn) minus the baseline 
signal established in the first 15 PCR cycles. ARn increases during PCR as alb PCR product copy number increases until the reaction reaches a 
plateau. C t (threshold cycle) represents the fractional cycle number at which a significant increase in Rn above a baseline signal (horizontal black 
line) can first be detected. Two replicate plots were performed for each standard sample, but the data for only one are shown here. Bottom: 
Standard curve plotting log starting copy number vs. C, (threshold cycle). The black dots represent the data for standard samples plotted in 
duplicate and the red dots the data for unknown genomic DNA samples plotted in triplicate. The standard curve shows 4 orders of linear dynamic 
range. 
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samples. We selected these 2 genes because they are located in 2 
chromosome regions (app, 21q21.2; alb, 4qll-ql3) in which no 
obvious genetic changes (including gains or losses) have been 
observed in breast cancers (Kallioniemi et aL, 1994). The ratio for 
the 18 normal leukocyte DNA samples fell between 0.7 and 1.3 
(mean 1.02 ± 0.21), and was similar for the 108 primary breast- 
tumor DNA samples (0.6 to 1 .6, mean 1 .06 ± 0,25), confirming 
that alb and app are appropriate reference disomic genes for 
breast-rumor DNA. The low range of the ratios also confirmed that 
the nucleotide sequences chosen for the primers and probes were 
not polymorphic, as mismatches of their primers or probes with the 
subject's DNA would have resulted in differential amplification. 

myc, ccndl and erb£2 gene dose in normal leukocyte DNA 

To determine the cut-off point for gene amplification in breast- 
cancer tissue, 18 normal leukocyte DNA samples were tested for 
the gene dose (N), calculated as described in "Material and 
Methods". The N value of these samples ranged from 0.5 to 1.3 
(mean 0.84 ± 0.22) for myc, 0.7 to 1.6 (mean 1.06 ± 0.23) for 
ccndl and 0.6 to 1 .3 (mean 0.9 1 rt 0.19) for erbBl. Since N values 
for myc, ccndl and erbBl in normal leukocyte DNA consistently 
fell between 0.5 and 1.6, values of 2 or more were considered to 
represent gene amplification in tumor DNA. 

myc, ccndl and erbB2 gene dose in breast-tumor DNA 

myc, ccndl and erbBl gene copy numbers in the 108 primary 
breast tumors are reported in Table I. Extra copies of ccndl were 
more frequent (23%, 25/108) than extra copies of erbBl (15%, 
16/108) and myc (10%, 11/108), and ranged from 2 to 18.6 for 
ccndl, 1 to 15.1 for erbBl, and only 2 to 4.6 for the myc gene. 
Figure 2 and Table II represent tumors in which the ccndl gene was 
amplified 16-fold (T145), 6- fold (T133) and non-amplified (Tl 18). 
The 3 genes were never found to be co-amplified in the same tumor. 
erbBl and ccndl were co-amplified in only 3 cases, myc and ccndl 
in 2 cases and myc and erbBl in I case. This favors the hypothesis 
that gene amplifications are independent events in breast cancer. 
Interestingly, 5 tumors showed a decrease of at least 50% in the 
erbBl copy number (N < 0.5), suggesting that they bore deletions 
of the 17q23 region (the site of erbBl). No such decrease in copy 
number was observed with the other 2 proto-oncogenes. 

. Comparison of gene dose determined by real-time quantitative 
PCR and Southern-blot analysis 

Southern-blot analysis of myc, ccndl and erbBl amplifications 
had previously been done on the same 1 08 primary breast tumors. A 
perfect correlation between the results of real-time PCR and 
Southern blot was obtained for tumors with high copy numbers 
(N > 5). However, there were cases (1 myc, 6 ccndl and 4 erbBl) 
in which real-time PCR showed gene amplification whereas 
Southern-blot did not, but these were mainly cases with low extra 
copy numbers (N from 2 to 2.9). 

DISCUSSION 

The clinical applications of gene amplification assays are 
currently limited, but would certainly increase if a simple, standard- 
ized and rapid method were perfected. Gene amplification status 
has been studied mainly by means of Southern blotting, but this 
method is not sensitive enough to detect low-level gene amplifica- 
tion nor accurate enough to quantify the full range of amplification 
values. Southern blotting is also time-consuming, uses radioactive 



TABLE 1 - DISTRIBUTION OF AMPLIFICATION LEVEL (N) FOR myc. 
ccndl AND erbB2 GENES IN 108 HUMAN BREAST TUMORS 



Gene 




Amplification level (N) 




<0.5 


0.5-1.9 2-4.9 


as 


myc 

ccndl 

erbBl 


0 

0 

5 (4.6%) 


97 (89.8%) 11 (10.2%) 
83 (76.9%) 17(15.7%) 
87 (80.6%) 8 (7.4%) 


0 

8 (7.4%) 
8 (7.4%) 



reagents and requires relatively large amounts of high-quality 
genomic DNA, which means it cannot be used routinely in many 
laboratories. An amplification step is therefore required to deter- 
mine the copy number of a given target gene from minimal 
quantities of tumor DNA (small early-stage rumors, cytopuncture 
specimens or formalin-fixed, paraffin-embedded tissues). 

In this study, we validated a PCR method developed for the 
quantification of gene over-representation in rumors. The method, 
based on real-time analysis of PCR amplification, has several 
advantages over other PCR-based quantitative assays such as 
competitive quantitative PCR (Celi et al, 1 994). First, the real-time 
PCR method is performed in a closed-tube system, avoiding the 
risk of contamination by amplified products. Re-amplification of 
carryover PCR products in subsequent experiments can also be 
prevented by using the enzyme uracil N-glycosylase (UNG) 
(Longo et ai, 1990). The second advantage is the simplicity and 
rapidity of sample analysis, since no post-PCR manipulations are 
required. Our results show that the automated method is reliable. 
We found it possible to determine, in triplicate, the number of 
copies of a target gene in more than 100 tumors per day. Third, the 
system has a linear dynamic range of at least 4 orders of magnitude, 
meaning that samples do not have to contain equal starting amounts 
of DNA. This technique should therefore be suitable for analyzing 
formalin-fixed, paraffin-embedded tissues. Fourth, and above all, 
real-time PCR makes DNA quantification much more precise and 
reproducible, since it is based on C, values rather than end-point 
measurement of the amount of accumulated PCR product. Indeed, 
the ABI Prism 7700 Sequence Detection System enables C t to be 
calculated when PCR amplification is still in the exponential phase 
and when none of the reaction components is rate-limiting. The 
within-run CV of the C, value for calibrator human DNA (5 
replicates) was always below 5%, and the between-assay precision 
in 5 different runs was always below 10% (data not shown). In 
addition, the use of a standard curve is not absolutely necessary, 
since the copy number can be determined simply by comparing the 
C K ratio of the target gene with that of reference genes. The results 
obtained by the 2 methods (with and without a standard curve) are 
similar in our experiments (data not shown). Moreover, unlike 
competitive quantitative PCR, real-time PCR does not require an 
internal control (the design and storage of internal controls and the 
validation of their amplification efficiency is laborious). 

The only potential disavantage of real-time PCR, like all other 
PCR-based methods and solid-matrix blotting techniques (South- 
ern blots and dot blots) is that is cannot avoid dilution artifacts 
inherent in the extraction of DNA from tumor cells contained in 
heterogeneous tissue specimens. Only FISH and immunohistochem- 
istry can measure alterations on a cell-by-cell basis (Pauletti et aL, 
1996; Slamon et aL, 1989). However, FISH requires expensive 
equipment and trained personnel and is also time-consuming. 
Moreover, FISH does not assess gene expression and therefore 
cannot detect cases in which the gene product is over-expressed in 
the absence of gene amplification, which will be possible in the 
future by real-time quantitative RT-PCR. Immunohistochemistry is 
subject to considerable variations in the hands of different teams, 
owing to alterations of target proteins during the procedure, the 
different primary antibodies and fixation methods used and the 
criteria used to define positive staining. 

The results of this study are in agreement with those reported in 
the literature, (i) Chromosome regions 4qll-ql3 and 2 1 q21 .2 
(which bear alb and app, respectively) showed no genetic alter- 
ations in the breast-cancer samples studied here, in keeping with 
the results of CGH (Kallioniemi et ai, 1994). (//) We found that 
amplifications of these 3 oncogenes were independent events, as 
reported by other teams (Bems et aL, 1 992; Borg et aL, 1992). (Hi) 
The frequency and degree of myc amplification in our breast tumor 
DNA series were lower than those of ccndl and erbBl amplifica- 
tion, confirming the findings of Borg et al. (1992) and Courjal etaL 
(1997). (iv) The maxima of ccndl and erbBl over-representation 
were 1 8-fold and 15-fold, also in keeping with earlier results (about 
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Figure 2 - ccndl and alb gene dosage by real-time PCR in 3 breast tumor samples: Tl 1 8 (E 1 2, C6, black squares), T133 (Git. B4, red squares) 
and T145 (A8, C8, blue squares). Given the C, of each sample, the initial copy number is inferred from the standard curve obtained during the same 
experiment. Triplicate plots were performed for each tumor sample, but the data for only one are shown here. The results are shown in Table II. 



30-fold maximum) (Berns et ai, 1992;Borge/a/„ 1992; Courjal et 
al t 1997). (v) The er/>B2 copy numbers obtained with real-time 
PCR were in good agreement with data obtained with other 
quantitative PCR-based assays in terms of the frequency and 
degree of amplification (An et ai, 1995; Deng et ai. 1996; Valeron 



et ai, 1996). Our results also correlate well with those recently 
published by Gelmini et ai ( 1 997), who used the TaqMan system to 
measure erb&2 amplification in a small series of breast tumors 
(n - 25), but with an instrument (LS-50B luminescence spectrom- 
eter, Perkin-Elmer Applied Biosystems) which only allows end- 



4 -V^"* 





666 



B1ECHE ETAL. 



TABLE II - EXAMPLES OF ccndl GENE DOSAGE RESULTS 
FROM 3 BREAST TUMORS 1 



Tumor 




ccndl 






alb 




Uccndi/alb 


Copy 
number 


Mean 


SD 


Copy 
number 


Mean 


SD 


Til 8 


4525 






4223 










4605 


4603 


77 


4365 


4325 


89 


1.06 




4678 






4387 








T133 


59821 






9787 










61659 


61100 


1111 


10092 


10137 


375 


6.03 




61821 






10533 








T145 


128563 






7321 










125892 


125392 


3448 


7762 


7672 


316 


16.34 




121722 






7933 









1 For each sample, 3 replicate experiments were performed and the mean 
and the standard deviation (SD) was determined. The level of ccndl gene 
amplification (Hccndllalb) is determined by dividing the average ccndl 
copy number value by the average alb copy number value. 



point measurement of fluorescence intensity. Here we report myc 
and ccndl gene dosage in breast cancer by means of quantitative 
PCR. (vi) We found a high degree of concordance between 
real-time quantitative PCR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
(> 5-fold). The slightly higher frequency of gene amplification 
(especially ccndl and erb&2) observed by means of real -time 
quantitative PCR as compared with Southem-blot analysis may be 
explained by the higher sensitivity of the former method. However, 
we cannot rule out the possibility that some tumors with a few extra 



gene copies observed in real-time PCR had additional copies of an 
arm or a whole chromosome (trisomy, tetrasomy or polysomy) 
rather than true gene amplification. These 2 types of genetic 
alteration (polysomy and gene amplification) could be easily 
distinguished in the future by using an additional probe located on 
the same chromosome arm, but some distance from the target gene. 
It is noteworthy that high gene copy numbers have the greatest 
prognostic significance in breast carcinoma (Borg et al, 1992; 
Slamon et al, 1987). 

Finally, this technique can be applied to the detection of gene 
deletion as well as gene amplification. Indeed, we found a 
decreased copy number of erbBl (but not of the other 2 proto- 
oncogenes) in several tumors; erbBl is located in a chromosome 
region ( 1 7q2 1 ) reported to contain both deletions and amplifica- 
tions in breast cancer (Bieche and Lidereau, 1995). 

In conclusion, gene amplification in various cancers can be used 
as a marker of pre-neoplasia, also for early diagnosis of cancer, 
staging, prognostication and choice of treatment. Southern blotting 
is not sufficiently sensitive, and FISH is lengthy and complex. 
Real-time quantitative PCR overcomes both these limitations, and 
is a sensitive and accurate method of analyzing large numbers of 
samples in a short time. It should find a place in routine clinical 
gene dosage. 
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Pawl a, Haynes Proteome analysis: Biological assay or data archive? 

Steven P. Gygi 

Daniel Flgeys j n reY j ew we examine the current state of proteome analysis. There are 

Ruedi Ae b ers °ld three main issues discussed: why it is necessary to study proteomes; how pro- 

teomes can be analyzed with current technology; and how proteome analysis 
Department of Molecular can ^ e usec j (0 enna nce biological research. We conclude that proteome anal- 

Biotechnology, University of ys j s j s an &ssen ti a j t 00 j j n t ne understanding of regulated biological systems. 

Washington, Seattle, VVA, USA Current technology, while still mostly limited to the more abundant proteins, 

enables the use of proteome analysis both to establish databases of proteins 
present, and to perform biological assays involving measurement of multiple 
variables. We believe that the utility of proteome analysis in future biological 
research will continue to be enhanced by further improvements in analytical 

technology. | 

i 

Contents resolution two-dimensional gel electrophoresis (2-DE), 

T )R £ 9 detected in the gel and identified by their amino acid 

1 Introduction aw sequence . The ease, sensitivity and speed with which gel- 

2 Rationale for proteome analysis , 1862 arated (eins can be ident ified by the use of recently 

2.1 Correlation between mRNA and protein developed mass spectrometry techniques have dramati- 
expression levels .... cally increased the interest in proteome- technology. One 

2.2 Proteins are dynamically modified and pro- of the most attractive features of such analyses is thai com- 
cessed - plex biological systems can potentially be studied in their 

2.3 Proteomes are dynamic and reflect the entirety, rather than as a multitude of individual compo- 
site of a biological system 1863 nefUs ^ mgkes ftr easier (Q uncover the ma com . 

3 Description and assessment of current pro- , e and oftcn ob re|ationships be tween mature 
teorne analysts technology . 1863 prod uct.s in cells. Urge-scale proteome characteriza- 

3.1 Technical requirements of proteome tech- Uon projecU have bee0 untIer taken for a number of.dif- 

. y '.' — [ f [ ferent organisms and cell types. Microbial proteome pro- 

3.2 2D electrophoresis - mass spectrometry: a ^ currentI in ss for example: Saccharo- 
common implementat.on of proteome anal- - myces cereyisiae [2]j Salmonella entertca [3], Spiroplasma 

« SIS * ■"•j" '/•e* , *-"*"L"r'^'iro/\'io -V melliferum [4 J, Mycobacterium tuberculosis [5], Ochrobac- 

3.3 Protein identification by LC-MS/MS, capu- , irt% „„,/,'• r/ri u„„„~nhiit,* ;»n,, r7i <s,*^hs> 
Ion , m\AQ/\AC ri3\AQ/\A? ic/;<; trum anthropi .[0}, Haemophilus trtfluenzae 17J, Synecho- 
^ u L ^f MS and CE ' MS/MS \l 6 /. cystis spp. [8], Escherichia colt [9J, Rhizobium legumino 

I'l l ^ rjj f r lite wrum 110], and Dictyostelium discoideum [II]. Proteome 

I'l'i ^ P vl¥/ Jc \trl projects underway for tissues of more complex organ- 

3.3.3 Cfct-M2>/Mh liio;> isms include those for: human bladder squamous cell 

3.4 Assessment of 2-DE-MS proteome tech- carcinomas (12), human liver [13], human plasma [13], 

? t °!° gy r ,■*■.'"•■■::•:••:*•; human kcratinocytes [12], human fibroblasts [12], mouse 

4 Utility of proteome analysis for biolo gI cai kidney r , 2]> and wt $erum [14) , n ^ manuscrjpt we cri . 

research liJbs ^^uy assess the concept of proteome analysis and the 

4.1 The proteome as a database 868 tcchnical feasibiIity of establishing complete proteome 

4.2 The proteome as a biological assay .... 868 and d[scuss [n which oteome analysis and 

5 Concluding remarks 870 biol ica! research inlWTOCt . 

6 References 1870 



1 Introduction 

A proteome has been defined as the protein complement, 
expressed by the genome of an organism, or, in multicel- 
lular organisms, as the protein complement expressed by a 
tissue or differentiated cell [1]. In the most common im- 
plementation of proteome analysis the proteins extracted 
from the cell or tissue analyzed are separated by high 
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Biotechnology, University of Washington, Box 357730, Seattle, WA, 
98195, USA (Tel: +206-685-4235; Pax: +206-685-6392; E-mail: ruedi 
@u. washington.edu) 

Abbreviations: CU), collision-induced dissociation; MS/MS, tandem 
mass spectrometry; SAGE, serial analysis of gene expression 

Keywords: Proteome / TVo-dimensional poiyacrylamide gel electro- 
phoresis / Tandem mass spectrometry 



2 Rationale for proteome analysis 

The dramatic growth in both the number of genome 
projects and the speed with which genome sequences 
are being determined has generated huge amounts of 
sequence information, for some species even complete 
genomic sequences ([15—17]). The description of the 
state of a biological system by the quantitative measure- 
ment of system components has long been a primary 
objective in molecular biology. With recent technical 
advances including the development of differential dis- 
play-PCR [18], cDNA microarray and DNA chip techno- 
logy (19, 20] and serial analysis of gene expression 
(SAGE) [21, 22], it is now feasible to establish global and 
quantitative mRNA expression maps of cells and tissues, 
in which the sequence of all the genes is known, at a 
speed and sensitivity which is not matched by current 
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protein analysis technology. Given the long-standing 
paradigm in biology that DNA synthesizes RNA which 
synthesizes protein, and the ability to rapidly establish 
global, quantitative mRNA expression maps, the ques- 
tions which arise are why technically complex proteome 
projects should be undertaken and what specific types of 
information could be expected from proteome projects 
which cannot be obtained from genomic and transcript 
. profiling projects. We see three main reasons for pro- 
teome analysis to become an essential component in the 
comprehensive analysis of biological systems, (i) Protein 
expression levels are not predictable from the mRNA 
expression levels, (ii) proteins are dynamically modified 
and processed in ways which are not necessarily 
apparent from the gene sequence, and (iii) proteomes 
are dynamic and reflect the state of a biological system. 

2.1 Correlation between mRNA and protein expression 
levels 

Interpretations of quantitative mRNA expression profiles 
frequently implicitly or explicitly assume that for specific 
genes the transcript levels are indicative of the levels of 
protein expression. As part of an ongoing study in our 
laboratory, we have determined the correlation of expres- 
sion at the mRNA and protein levels for a population of 
selected genes in the yeast Saccharomyces cerevisiae 
growing at mid-log phase (S. P. Gygi et at, submitted for 
publication). mRNA expression levels were calculated 
from published S AG E. t frequency tables (22J. Protein 
expression levels were quantified by metabolic radiola- 
beling of the yeast proteins, liquid scintillation counting 
of the protein spots separated by high resolution 2tDE 
and mass spectrometry identification of the protein(s) 
migrating to each spot. The selected 80 samples consti- 
tute a relatively homogeneous group with respect to pre- 
dicted half-life and expression level of the protein pro- 
ducts. Thus far, we have found a general trend but no 
strong correlation between protein and transcript levels 
(Fig, 1). For some genes studied equivalent mRNA trans- 
cript levels translated into protein abundances which 
varied by more than 50-fold. Similarly, equivalent steady- 
state protein expression levels were maintained by trans- 
cript levels varying by as much as 40-fold (S. P. Gygi 
et at. t submitted). These results suggests that even for a 
population of genes predicted to be relatively homoge- 
neous with respect to protein half-life and gene expres- 
sion, the protein levels cannot be accurately predicted 
from the level of the corresponding mRNA transcript. 

2.2 Proteins are dynamically modified and processed 

In the mature, biologically active form many proteins are 
post-translationally modified by glycosylation, phosphor- 
ylation, prenylation, acylation, ubiquitination or one or 
more of many other modifications [231 and many pro- 
teins are only functional If specifically associated or com- 
plexed witb other molecules, including DNA, RNA, pro- 
teins and organic and inorganic cpfactors. Frequently, 
modifications are dynamic and reversible and may alter 
the precise three-dimensional structure and the state of 
activity of a protein. Collectively, the state of modifica- 
tion of the proteins which constitute a biological system 
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Figun: /. Correlation between mRNA and protein levels in yeast cells. 
For a selected population of 80 genes, protein levels were measured 
by 3i -S-radiotabeling and mRNA levels were calculated from publi- 
shed SAGE tables. Inset: expanded view of the low abundance region. 
For more experimental details, also see Figs. 5 and 6, (S. P. Gygi et a/,, 
submitted). 

are important indicators for the state of the system. The 
type of protein modification and the sites modified at a 
specific cellular state can usually not be determined 
from the gene sequence alone. 



2,3 Proteomes are dynamic and reflect the state of a 
biological system 

A single genome can give rise to many qualitatively and 
quantitatively different proteomes. Specific stages of the 
cell cycle and states of differentiation, responses to 
growth and nutrient conditions, temperature and stress, 
and pathological conditions represent cellular states 
which are characterized by significantly 'difFerent pro- 
teomes. The proteome, in principle, also reflects events 
that are under translation al and post-translationa! con- 
trol. It is therefore expected that proteomics will be able 
to provide the most precise and detailed molecular des- . 
cription of the state of a cell or tissue, provided that the 
external conditions defining the state are carefully deter- 
mined. In answer to the question of whether the study 
of proteomes is necessary for the analysis of biomolec- 
ular systems, it is evident that the analysis of mature pro- 
tein products in cells is essential as there are numerous 
levels of control of protein synthesis; degradation, 
processing and modification, which are only apparent by 
direct protein analysis. 



3 Description and assessment of current proteome 
analysis technology 

3.1 Technical requirements of proteome technology 

In biological systems the level of expression as, well as 
the states of modification, processing and macro-molec- 
ular association of proteins are controlled and modu- 
lated depending on the state of the system. Comprehen- 
sive analysis of the identity, quantity and state of modifi- 
cation of proteins therefore requires the detection and 
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quantitation of the proteins which constitute the system, 
and analysis of differentially processed forms. There are 
a number of inherent difficulties in protein analysis 
which complicate these tasks. First, proteins cannot be 
amplified, It is possible to produce large amounts of a 
particular protein by over-expression in specific cell sys- 
tems. However, since many proteins are dynamically 
post-translationally modified, they cannot be easily am- 
plified in the form in which they finally function in the 
biological system. It is frequently difficult to purify from 
the native source sufficient amounts of a protein for 
analysis. From a technological point of view this trans- 
lates into the need for high sensitivity analytical tech- 
niques. Second, many proteins are modified and pro- 
cessed post-translationally. Therefore, in addition to the 
protein identity, the structural basis for differentially 
modified isoforms also needs to be determined. The dis- 
tribution of a constant amount of protein over several 
differentially modified, isoforms further reduces the 
amount of each species available for analysis. The com- 
plexity and dynamics of post-translational protein edit- 
ing thus significantly complicates proteome studies. 
Third, proteins vary dramatically with respect to their 
solubility in commonly used solvents. There arc few, if 
any, solvent conditions in which all proteins are soluble 
and which are also compatible with protein analysis. This 
makes the development of protein purification methods 
particularly difficult since both protein purification and 
solubility have to be achieved under the same condi- 
tions. Detergents, in particular sodium dodecyl sulfate 
(SDS), are frequently added to aqueous solvents to 
maintain protein solubility. The compatibility with SDS 
is a big advantage of SDS polyacrylaiuide gel electro- 
phoresis (SDS-PAGE) over other protein separation 
""techniques. Thus, SDS-PAGE and two-dimensional gel 
electrophoresis, which also uses SDS and other deter- 
gents, are the most general and preferred methods for 
the purification of small amounts of proteins, provided 
that activity does not necessarily need to be maintained. 
Lastly, the number of proteins in a given cell system is 
typically in the thousands. Any attempt to identify and 
categorize all of these must use methods which are as 
rapid as possible to allow completion of the project 
within a reasonable time frame. Therefore, a successful, 
general proteomics technology requires high sensitivity, 
high throughput, the ability to differentiate differentially 
modified proteins, and the ability to quantitatively dis- 
play and analyze all the proteins present in a sample. 

3.2 2-D electrophoresis — mass spectrometry: a common 
implementation of proteome analysis 

The most common currently used implementation of 
proteome analysis technology is based on the separation 
of proteins by two-dimensional (IEF/SDS-PAGE) gel 
electrophoresis and their subsequent identification and 
analysis by mass spectrometry (MS) or tandem mass 
spectrometry (MS/MS). In 2-DE, proteins are first separ- 
ated by isoelectric focusing (1EF) and then by SDS- 
PAGE, in the second, perpendicular dimension. Separ- 
ated proteins are visualized at high sensitivity by staining 
or autoradiography, producing two-dimensional arrays of 
proteins, 2-DE gels are, at present, the most commonly 
used means of global display of proteins in complex 



COPY 



Electrophortjit 1998. /P, 1862-1871 



samples. The separation of thousands of proteins has 
been achieved in a single gel [24, 25] and differentially 
modified proteins are frequently separated. Due to the 
compatibility of 2-DE with high concentrations of deter- 
gents, protein denaturants and other additives promoting 
protein solubility, the technique is widely used. 

The second step of this type of proteome analysis is the 
identification and analysis of separated proteins. Individ- 
ual proteins from polyacrylamide-gels have traditionally 
been identified using A'-terminal sequencing [26, 27), 
internal peptide sequencing [28, 29], immunoblotting or 
comigration with known proteins [30]. The recent dra- 
matic growth of large-scale genomic and expressed 
sequence tag (EST) sequence databases has resulted iryi 
fundamental change in the way proteins are identified ly 
their amino acid sequence. Rather than by the traditional 
methods described above, protein sequences are now fre- 
quently determined by correlating mass spectral or 
tandem mass spectral data of peptides derived from pro- 
teins, with the information contained in sequence data- 
bases [31-33]. 

There are a number of alternative approaches to pro- 
teome analysis currently under development. There is 
considerable interest in developing a proteome analysis 
stragegy which bypasses 2-DE altogether, because it is 
considered a relatively slow and tedious process, and 
because of perceived difficulties in extracting proteins 
from the gel matrix for analysis. However, 2-DE as a 
starting point for proteome analysis has many advan- 
tages compared to other techniques available today. The 
most significant strengths of the 2-DE- MS approach 
include the relatively uniform behavior of proteins in 
gels, the ability to quantify spots and the high resolution 
and simultaneous display of hundreds to thousands of 
proteins within a reasonable time frame. 

A schematic diagram of a typical procedure of the identi- 
fication of gel-separated proteins is shown in Fig. 2. Pro- 
tein spots detected in the gel are enzymatically or chemi- 
cally fragmented and the peptide fragments are isolated 
for analysis, as already indicated, most frequently by MS 
or MS /MS. There are numerous protocols for the gener- 
ation of peptide fragments from gel-separated proteins. 
They can be grouped into two categories, digestion in 
the gel slice [28, 34] or digestion after electrotransfer out 
of the gel onto a suitable membrane ([29, 35—37] and 
reviewed in [38]). In most instances either technique is 
applicable and yields good results. The analysis of MS or 
MS/MS data is an important step in the whole process 
because MS instruments can generate an enormous 
amount of information which cannot easily be managed 
manually. Recently, a number of groups have developed 
software system's dedicated to the use of peptide MS 
and MS/MS spectra for the identification of proteins. 
Proteins are identified by correlating the information 
contained in the MS spectra of protein digests or 
MS /MS spectra of individual peptides with data con- 
tained in DNA or protein sequence databases. 

The systems we are currently using in our laboratory are 
based on the separation Of the peptides contained in pro- 
tein digests by narrow bore or capillary liquid chromatog- 
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Figure 2. Schematic diagram of a procedure for identification of eel- 
separated proteins. Peptides can either be separated by a technique 
such as LC or CE, or infused as a mixture and sorted in the MS. Data- 
base searching can either be performed on peptide masses from an 
MS spectrum, peptide fragment massesfrom CID spectra of peptides, 
or a combination of both. 



raphy [39, 40) or capillary electrophoresis [41], the anal- 
ysis of the separated peptides by electrospray ionizar 
tion (ESI) MS/MS, and the correlation of the generated 
peptide spectra with sequence databases using the 
SEQUEST program developed at the University of Wash- 
ington [32, 33). The system automatically performs the 
following operations: a particular peptide ion character- 
ized by its mass-lo-charge ratio is selected in the MS out 
of all the peptide ions present in the system at a parti- 
cular time; the selected peptide ion is collided in a colli- 
sion cell with argon (collision-induced dissociation, 
CID) and the masses of the resulting fragment ions are 
determined in the second sector of the tandem MS; this 
experimentally determined CID spectrum is then corre- 
lated with the CID spectra predicted from all the pep- 
tides in a sequence database which have essentially the 
same mass as the peptide selected for CID; this correla- 
tion matches the isolated peptide with a sequence seg- 
ment in a database and thus identifies the protein from 
which the peptide was derived. There are a number of 
alternative programs which use peptide CID spectra for 
protein identification, but we use the SEQUEST system 
because it is currently the most highly automated pro- 
gram and has' proven to be successful, versatile and 
robust. 



required. As an approximate guideline, for samples con- 
taining tens of picomoles of peptides, LC-MS/MS is 
most appropriate; for samples containing low picomole 
amounts to high femtomolc amounts we use capillary 
LC-MS/MS; and for samples containing fcmtomoles or 
less, CE-MS/MS is the method of choice. 

3.3.1 .LC-MS/MS 

The coupling of an MS to an HPLC system using a 
0,5 mm diameter or bigger reverse phase (RP) column 
has been described in detail [42]. This system has several 
advantages if a large number of samples are to be ana- 
lyzed and all are available in sufficient quantity. The 
LC-MS and database searching program can be run in a 
fully automated mode. using an autosampler, thus maxi- 
mizing sample throughput and minimizing the need for 
operator interference. The relatively large column is 
tolerant of high levels of impurities from either gel prep- 
aration or sample matrix. Lastly, if configured with a 
flow-splitter and micro-sprayer [40], analyses can be per- 
formed on a small fraction of the sample, (less than 5%) 
while the remainder of the sample is recovered in very 
pure solvents. This latter feature is particularly useful 
when an orthogonal technique is also used to analyze 
peptide fractions, such as scintillation of an introduced 
radiolabel, and this data can be correlated with peptides 
identified by CID spectra. 

3.3.2 Capillary LC-MS 

An increase of sensitivity of approximately tenfold can be 
achieved by using a capillary LC system with a 100 pm ID 
column rather than a 0.5 ram ID column as referred to 
above. Since very low flow rates are required for such 
columns, most reports have used a precotumn flow split- 
ting system for producing solvent gradients. We have 
recently desribed the design and construction of a novel 
gradient mixing system which enables the formation 
of reproducible gradients at Yery low flow rates (low 
nL/min) without the need for flow splitting (A. Ducret 
ei at., submitted for publication). Using this capillary 
LC-MS/MS system we were able to identify gel-separat- 
ed proteins if low picomole to high femto mole amounts 
were loaded onto the gel [40]. litis system is as yet not 
automated and, like all capillary LC systems, is prone to 
blockage of the columns by microparticulates when ana- 
lyzing gel-separated proteins. 



3.3 Protein identification by LC-MS/MS, capillary 
LC-MS/MS and CE-MS/MS 

It has been demonstrated repeatedly that MS has a very 
high intrinsic sensitivity. For the routine analysis of gel- 
separated proteins at high sensitivity, the most signif- 
icant challenge is the handling of small amounts of 
sample. The crux of the problem is the extraction and 
transferal of peptide mixtures generated by the digestion 
of low nanogram amounts of protein, from gels into the 
MS/MS system without significant loss of sample or 
introduction of unwanted contaminants. We employ 
three different systems for introducing gel-purified sam- 
ples into an MS, depending on the level of sensitivity 



3.3.3 CE-MS/MS 

The highest level of sensitivity for analyzing gel-sep- 
arated proteins can be achieved by using capillary elec- 
trophoresis — mass spectrometry (CE-MS). We have de- 
scribed in the past a solid-phase extraction capillary elec- 
trophoresis (SPE-CE) system which was used with triple 
quadrupole and ion trap ESI-MS/MS systems for the 
identification of proteins at the low femtomole to sub- 
femtomole sensitivity level (43, 44]. While this system is 
highly sensitive, its operation is labor-intensive and its 
operation has not been automated, In order to devise an 
analytical system with both the sensitivity of a CE and 
the level of automation of LC, we have constructed 



1866 



BEST AVAILABLE COPY 



P. A. Hayncs ti al. 



Eltctrophoreih 1998, 19, 1862-1871 




liquid Junction 
+ 1,7 »o 1.9 kV 



MS entrance 



Reservoirs 



12 cm capillary 



Roaorvolr3 "PUMP" 
0.6 cm of device = j j 

microfabricated devices for the introduction of samples 
into RSI-MS for high-sensitivity peptide analysis. 

The basic device is a piece of glass into which channels 
of 10-30 urn in depth and 50-70 um in diameter are 
etched by using photolithography/etching techniques 
similar to the ones used in the semiconductor industry. 
(A simple device is shown in Fig. 3). The channels arc 
connected to an external high voltage power supply [451. 
Samples are manipulated on the device and ofl the 
device to the MS by applying different potentials - to the 
reservoirs. This creates a solvent flow by electroosmotic 
pumping which can be redirected by changing the posi- 
tion of the electrode. Therefore, without the need for 
valyes or gates and without any external pumping, the 
flow can be redirected by simply switching the position 
of the electrodes on the device: The direction and rate of 
the flow can be modulated by the size and the polarity 
of the electric field applied and also by the charge state 
of the surface. 

The type of data generated by the system is illustrated in 
Fig. 4, which shows the mass spectrum of a peptide sample 
representing the tryptic digest of carbonic anhydrase at 
290 fmol/uL, Each numbered peak indicates a peptide suc- 
cessfully identified as being derived from carbonic an- 



Figure 3. Schematic illustration of a 
microfabricated analytical system for CE, 
consisting of a micromachined device, 
. coated capillary electroosmotic pump, 
and microetcctrospray Interface. The 
dimensions of the channels and reservoir 
are as indicated in the text. The channels 
on the device were graphically enhanced 
to make them more visible. Reproduced 
from (45), with permission. 

hydrase. Some of the unassigned signals may be chemical 
or peptide contaminants. Die MS is programmed to auto- 
matically select each peak and subject the peptide to CID. 
The resulting CID spectra are then used to identify the 
protein by correlation with sequence databases. Therefore, 
this system allows us to concurrently apply a number of 
protein digests onto the device, to sequentially mobilize 
the samples, to automatically generate CID spectra of 
selected peptide ions and to search sequence databases 
for protein identification. These steps are performed auto- 
matically without the need for user input and proteins ean 
be identified at very low femtomole leyel sensitivity at a 
rate of approximately one protein per 15 min. 

3.4 Assessment of 2-DE-MS proteome technology 

Using a combination of. the analytical techniques de- 
scribed above we have identified the 80 protein spots 
indicated in Fig. 5. 'The protein pattern was generated by 
separating a total of 40 microgram of protein contained 
in a total cell lysate of the yeast strain YPH499 by high 
resolution 2-DE and silver staining of the separated pro- 
teins. To estimate how far this type of proteome analysis' 
can penetrate towards the identification of low abun- 
dance proteins, we have calculated the codon bias of the 
genes encoding the respective proteins. Codon bias is a 




Figure 4. MS spectrum of a tryptic digest 
of carbonic anhydrase using the microfa- 
bricated system shown in Fig. 3. 290 
fmol/uL of carbonic anhydrase tryptic 
digest was infused into a Finnigan LCQ 
ion. trap MS. Each peak was selected for 
CID, and those which were identified a$ 
containing peptides derived from car- 
bonic anhydrase are numbered. Repro- 
duced from 145], with permission. 
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Figure 5 2-DE separation of a lysate of yeast cells, with identified proteins highlighted. The first dimension of separation was an IPO from 
pit 3-10, and the second dimension was a 10%T SDS-PAGE gel. Proteins were visualized by silver Staining. Further details of experimental 
procedures are included in S. P. Gygi ct at. (submitted). 
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caiculated measure of the degree of redundancy of trip- 
let DNA oodons used to produce each amino acid in a 
particular gene sequence. It has been shown to be a 
useful indicator of the level of the protein product of a 
particular gene sequence present in a cell [46]. The gen- 
eral rule which applies is that the higher the value of the 
codon bias calculated for a gene, the more abundant the 
protein product of that gene becomes. The calculated 
codon bias values corresponding to the proteins identi- 
fied in Fig. 5 are shown in Fig. 6b. Nearly all of the pro- 
teins identified (> 95%) have codon bias values of > 0.2, 
indicating they are highly abundant in cells. In contrast, 
codon bias values calculated for the entire yeast genome 
(Fig. 6a) show that the majority of proteins present in 
the proteome have a codon bias of < 0.2 and are thus of 
low abundance. 

This finding is of considerable importance in our assess- 
ment of the current status of proteome analysis technol- 
ogy. It is clear that even using highly sensitive analytical 
techniques, we are only able lo visualize and identify the 



more abundant proteins. Since many important regula- 
tory proteins are present only at low abundance, these 
would not be amenable to analysis using such tech- 
niques. This situation would be exacerbated in the anal- 
ysis of proteomes containing many more proteins than 
the approximately 6000 gene products' present in yeast 
cells [16j. In the analysis of, for example, the proteome 
of any human cells, there are potentially 50000-100000 
gene products [47]. Inherent limitations on the amount 
of protein that can be loaded on 2-DE, and the number 
of components that can be resolved, indicate that only 
the most highly " abundant fraction of the many gene 
products could be successfully analyzed. One approach 
that has been employed to circumvent these limitations 
is the use of very narrow range immobilized pH gradient 
strips for the first-dimension separation of 2-DE [48], 
Since only those proteins which focus within the narrow 
range will enter the second dimension of separation, a 
much higher sample loading within the desired range is 
possible. This, in turn, can lead to the visualization arid 
identification of less abundant proteins. 
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Figure 6. Calculated coduu bias values for yeast proteins. (A) Distribu- 
tion of calculated values for the entire yeast proteome. (0) Distribu- 
tion of calculated values for the subset of 80 identified proteins also 
shown in Figs. I and 5. Further details oT experimental procedures arc 
included in S. P. Gyt;i ct at. (submitted). 



4 Utility of proteome analysis for biological 
research 

For the success of proteomics as a. mainstream approach 
to the analysis of biological systems it is essential to 
define how proteome analysis and biological research 
projects intersect. Without a clear plan for the implemen- 
tation of proteome-type approaches into biological re- 
search projects the full impact of the technology can not 
be realized. The literature indicates that proteome anal- 
ysis is used both as a database/data archive, and as a bio- 
logical assay or biological research tool. 

4A Toe proteome as a database 

The use' of proteomics as a database or data archive 
essentially entails an attempt to identify all the proteins 
in a cell or species and to annotate each protein with the 
known biological information that is relevant for each 
protein. The level of annotation can, of course, be exten- 
sive; The most, common implementation of this idea is 
the separation of proteins. by high resolution 2-DE, the 
identification of each detected protein spot and the 
annotation of the protein spots in a 2-DE gel database 
format. This approach is complicated by the fact that it is 
difficult to precisely define a proteome and to decide 
which proteome should be represented in the database. 
In contrast to the genome of a species, which is essen- 
tially static, the proteome is highly dynamic. Processes 
such as differentiation, cell activation and disease can all 
significantly change the proteome of a species. This is 
illustrated in Fig. 7. The figure shows two high-resolu- 
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tion 2-DE maps of proteins isolated from rat scrum. 
Fig. 7 A is from the serum of normal rats, while Fig. 7B 
is from the serum of rats in acute-phase serum after 
prior treatment .with an inflammation-causing agent [49]. 
It is obvious that the protein patterns are significantly 
different in several areas, raising the question of exactly 
which proteome is being described. 

Therefore, a comprehensive proteome database of a spe- 
cies or cell type needs to contain all of the parameters 
which describe the state and the type of the cells from 
which the proteins were extracted as well as the software 
tools to search the database with queries which reflect 
the dynamics of biological systems. A comprehensive 
proteome database should be capable of quantitatively 
describing the fate of each protein if specific systerr* 
and pathways are activated in the cell. Specifically, the 
quantity, the degree of modification, the subcellular loca- 
tion and the nature- of molecules specifically interacting 
with a protein as well as the rate of change of these 
variables should be described. Using these admittedly 
stringent criteria, there is currently no comlete proteome 
database. A number of such databases are, however, in 
the process of being constructed. The most advanced 
among them, in our opinion, are the yeast protein data- 
base YPD [50] (accessible at http://www.ypd.com) and 
the human 2D-PAGE databases of the Danish Centre 
for Human Genome Research [12J (accessible at http:// 
biobase.dk/cgi-bin/celis). While neither can be con- 
sidered complete as not all of the potential gene pro- 
ducts are identified, both contain extensive annotation 
of supplemental information for many of the spots 
which are positively identified in reference samples. 

4,2 The proteome as a biological assay 

The use. of proteome analysis as a biological assay or 
research tool represents an alternative approach to inte- 
grating biology with proteomics. To investigate the state 
of a system, samples are subjected to a specific proceess 
that allows the quantitative or qualitative measurement 
of some of the variables which describe the system. In 
typical biochemical assays one variable (e.g., enzyme 
activity) of a single component (e.g., a particular en- 
zyme) is measured. Using proteomics as an assay; mul- 
tiple variables (e.g., expression level, rate of synthesis, 
phosphorylation state, etc.) arc measured concurrently 
on many (ideally alt) of the proteins in a sample. The 
use of proteomics as an assay is a less far-reaching prop- 
osition than the construction of a comprehensive pro- 
teome database. It does, however, represent a pragmatic 
approach which can be adapted to investigate specific 
systems and pathways, as long as the interpretation of 
the results takes into account that with current technol- 
ogy not all of the variables which describe the system 
can be observed (see Section 3.4), 

A common implementation of proteome analysis as a 
biological assay is when a 2-DE protein pattern gener- . 
ated from the analysis of an experimental sample is 
compared to an array of reference patterns representing 
different states of the system under investigation. The 
state of the experimental system at the time the sample 
was generated is therefore determined by the quantita- 
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tive comparative analysts of hundreds to a few thousand 
proteins. Comparative analysis of the 2-DE patterns fur- 
thermore highlights quantitative and qualitative differ- 
ences in the protein profiles which correlate with the 
state of the system. For this type of analysis it is hot 
essential that all the proteins are identified or even visu- 
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alixed, although the results become more informative as 
more proteins are compared. It is obvious, however, that 
the possibility to identify any protein deemed character- 
istic for a particular state dramatically enhances this 
approach by opening up new avenues for experimenta- 
tion. 
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Figure 7. High resolution 2-DE map of proteins Isolated from rat scrum with- or without prior exposure to-an iuflam- 
matioo-causing agent. (A) normal rat serum, (B) acute-pbase scrum from rats which had previously been exposed to 
an inflammation-causing agent. The first dimension of separation is an IPG from pH 4—10, and the second dimen- 
sion is a 7,5-17,5%T gradient SDS-PAGE gel. Proteins were yisuaJiz&d by staining with amido black. Further details 
of experimental procedures are included in f!4, 49], 
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Proteorne analysis as a biological assay has been success- 
fully used in the field of toxicology, to characterize 
disease states or to study difFerentiai activation of cells. 
The approach is limited, of course, by the fact that only 
the visible protein spots are included in the assays and it 
is well known thai a substantia! but far from complete 
fraction of cellular proteins are detected if a total cell 
lysate is separated by 2-DE. Proteins may not be 
detected in 2-DE gels because they are not abundant 
enough to be visualized by the detection method used, 
because they do not migrate within the boundaries (size, 
p7) resolved by the gel, because they are not soluble, 
under the conditions used, or for other reasons. 

A different way to use proteorne analysis as a biological 
assay to define the state of a biological system is to take 
advantage of the wealth of information contained in 
2-DE protein patterns. 2-DE is referred to as two-dimen- 
sional because of the electrophoretic mobility and the 
isoelectric points which define the position of each pro- 
tein in a 2-DE pattern. In addition to the two dimen- 
sions used to generate the protein patterns, a number of 
additionaJ data dimensions are contained in the protein 
patterns. Some of these dimensions such as protein 
expression level, phosphorylation state, subcellular loca- 
tion, association with other proteins, rate of synthesis or 
degradation indicate the activity state of a protein or a 
biological system. Comparative analysis of 2-DE protein 
patterns representing different states is therefore ideally 
suited for the detection, identification and analysis of 
suitable markers. Once again it must be emphasized that 
in this type of experiment only a fraction of the cellular 
proteins is analyzed. Since many regulatory proteins are 
of low abundance, this limitation is a concern, particu- 
larly in cases in which regulatory pathways are being 
investigated. 

5 Concluding remarks 

In this report we have addressed three main issues 
related to proteorne analysis. First, we have discussed 
the rationale for studying proteomes. Second, we have 
assessed the technical feasibility of analyzing proteomes 
and described current proteorne technology, and third, 
we have analyzed the utility of proteorne analysis for bio- 
logical research. It is apparent that proteorne analysis is 
an essential tool in the analysis of biological systems. 
The multi-level control of protein synthesis and degrada- 
tion in cells means that only the direct analysis of 
mature protein products can reveal their correct identi- 
ties, their relevant state of modification and /or associa- 
tion and their amounts. Recently developed methods 
have enabled the identification of proteins at ever- 
increasing sensitivity levels and at a high level of auto- 
mation of the analytical processes. A number of tech- 
nical challenges, however, remain. While it is currently 
possible to identify essentially any protein spots that can 
be visualized by common staining methods, it is ap- 
parent that without prior enrichment only a relatively 
small and highly selected population of long-lived, 
highly expressed proteins is observed. There are many 
more proteins in a given cell which are not visualized by 
such methods. Frequently it is the low abundance pro- 
teins that execute key regulatory functions. 



We have outlined the two principal ways proteorne anal- 
ysis is currently being used to intersect with biological 
research projects: the proteorne as a database or data 
archive and proteorne analysis as a biological assay. Both 
approaches have in common that at present they are con- 
ceptually and technically limited. Current proteorne data- 
bases typically are limited to one ceil type and one state 
of a cell and therefore do not account for the dynamics 
of biological systems. The use of proteorne analysis as a 
biological assay can provide a wealth of information, but 
it is limited to the proteins detected and is therefore not 
truly proteome-wide. "These limitations in proteomics are 
to a large extent a reflection of the fact that proteins in 
their fully processed form cannot easily be amplified and 
are therefore difficult to isolate in amounts sufficientjbr 
analysis or experimentation. The fact that to date/no 
complete proteorne has been described further attests to 
these difficulties. With continued rapid progress in pro- 
tein analysis technology, however, we anticipate that the 
goal of complete proteorne analysis will eventually 
become attainable. 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-I and WISP-2, that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISP-3, these proteins define a subfamily of the connective 
tissue growth factor family; Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (i) C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracyline repressive promoter, and (it) Wnt-1 transgenic 
mice. The WISP- 1 gene was localized to human chromosome 
8q24.1-8q24,3. WISP- 1 genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to >30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. \MSP-3 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to >40-fold) in 63% of the colou tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20qJ2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to >30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysteine -rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsn) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-3/3 (GSK-3/3) resulting in an increase in 
jS-catenin levels. Stabilized /3-catenin interacts with the tran- 
scription factor TCF/Lefl, forming a complex that appears in 
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the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
j3-catenin levels (9). APC is phosphorylated by GSK-3j3, binds 
to 0-catenin, and facilitates its degradation. Mutations in 
either APC or 0-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3, a member of 
the transforming growth factor (TGF)^ superfamily, and the 
homeobox genes, engrailed, goosecoid, twin (Xtwn), and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that arc relevant to the transformed cell pheno- 
type, we used a PCR -based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this ceil line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and rcfractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP-1 
and WISP-2, and a third related gene, WISPS. The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
tiQVy a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH, SSH was performed by using the PCR-Select cDNA 
Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor; CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type. C module. 

Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AF 1 00777, 
AH00778, AF 100779, AF1007SO, and AF100781). 
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cDNA was synthesized from 2 fxg of poly(A) + RNA isolated 
from the.C57MG/Wnt : l cell line and driver cDNA from 2 pLg 
of poly(A) + RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-1 were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-} 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
coding full-length mouse and human WISP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length. cDNAs encoding W1SP-3 were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 fiM of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles, 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Siiu Hybridization. 33 P-labeled sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP- 1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. AH tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-ainplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers, 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human ceil lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation fhiorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCI cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-mvc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Genc-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantilate the genes. The 
relative gene copy number was derived by using the form u hi 
2<Act> where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The- 
d-mcthod was used for calculation of the SE of the gene copy 
number or RNA expression level. The 1-KASP-specific signal was 
normalized to that of the glyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-EImer Applied Biosyslems. 

RESULTS 

Isolation of WISP-] and WISP-2 by SSH, To identify Wnt- 
1 -inducible genes, we used the technique of SSH using the 
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mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/V/nt-l cells. 

Two of the cDNAs, WISP- 1 and WISP-2, were differentially 
expressed, being induced in the C57MG/Wnt-1 cell tine, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Pig. 1 A and B). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on /3-catenin levels (13, 14). Expression of WISP-1 was 
up-regulated approximately 3-fold in the C57MG/Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-J induction. C57MG cells express- 
ing the Wru-I gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed slate but show a strong induction of Wnr-I mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnhl mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones, of human WISP-I were isolated and the 
sequence compared with mouse WISP-I. ThecDNA sequences 
of mouse.and human WISP- 1 were 1 ,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of ^40,000 (Af r 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Pig. 2A), 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of -27,000 (M T 27 K) (Fig. 2B). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N -linked glycosylation sites, and mouse WISP-2 has one at 
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Fig. 1. WISP- 1 and WISP-lnra induced by Wnt-1, but not Wnt-4, 
expression in C57MG ceils. Northern analysis of W!SP-! (A) and 
WISP-2 (B) expression in C57MO, C57MG/Wnt-1, and C57MG/ 
Wm-4 cells. Poly(A) 4 RNA (2 ,i,g) was subjected to Northern blot 
analysis and hybridized with «'i 70-bp mouse WISP- /-specific probe 
(amino acids 278-300) or a. 190-bp WISP- 2- specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were re hybridized with 
human /3- act in probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human (A) and mouse and human WISP-2 {B). The potential 

signal sequence, insulin-like growth factor-binding protein flGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues thai are con- 
served among the 38 cysteines found in WISP- J. 

Identification of WISP-3, To search for related proteins, we 
screened expressed sequence tag (EST) databases with, the 
W1SP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WiSP-3 cDNA of L371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with W1SP-1 and 32% identity with WISP-3 (Fig. M). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-1, WISP-2, and WISP-3 are novel sequences; 
however, mouse WISP- 1 is the same as the recently identified 
Elml gtnt.Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15), 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-I (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the proioonco- 
gene nov. CTGF is a chemotactic and mitogenic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-/3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma ove rex pressed) is an immediate 
early gene associated with quiescence and found altered in" 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, similarity to Wni*l. AM arc- 
secreted, cysteine-ricb heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WJSP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3B) (21 ). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CX.XC) conserved in most insulin- like growth factor (IGF)- 
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Fig. 3. (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WKSP-3 ate indicated with n dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical linus). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated widi a dot. (C) Expression of 
WISP inRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetai tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas W1SP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF- BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagcns and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and S).. 
A short variable region follows the VWC domain. The third 
module, the thrombospondin ('ISP) domain is involved in 
binding .to sulfated glycoconjugates.and contains six cysteine 
residues and a conserved WS.xCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT), module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimeri/ation and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and H). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs arc secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown), 

Expression of WISP mKNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and let aJ multiple tissue cDNA panels. 
WISP-l expression was seen in the adult heart/ kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Utile or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP- 2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISPS was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISPS 
expression were detected in placenta, ovary, prostate, mid 
small intestine. 

to Sitii Localization of WISP' I ami WISPS. Expression of 
WISP-l and WISPS was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP-l was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WISP-} expression also was observed foca I ly within tumor 
cells (data not shown). No expression was observed in normal 
breast-. Like- WISP- J, WISPS expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 E-If). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 

BJ 




I v ig.4. (A, C, £, and G) Representative hcmatoxylin/eosin-staijied 
images from breast lumors in W'nt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP- J expression are shown in B and 
D. The tumor is a moderately wcll-difiereniiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power {A and B), 
expression of WISP- 1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D) t and tumor cells arc negative. 
Focal expression of WISP-L however, was observed in tumor cells in 
some areas. Images of WISP- 2 exptession are shown in E-H. At low 
power (E and F), expression of WISP-2 is seen in cells lying within the 
fibrovnscuflur tumor stroma. At higher magnification, these cells 
appeared to he adjacent to capillary vessels whereas tumor cells art: 
negative (G and H). 
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the predominant cell type expressing WlSP-I was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP- J is approximately 
3,48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31.] on chromosome 8q'24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates thai WISP-l is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod « 1,000) on 
chromosome 20ql2-20ql3. 1 . Human WISPS mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod - 1,000). WISPS is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification nnrJ Aberrant Expression aTWISPs in Human 
Colon Tumors, Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-l resides in the same 
general chromosomal .location (8q24) as c-myc t we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
assessed by quantitative PGR and Southern blot analysis. (Fig. 
5 A and B). Both methods detected similar degrees of WISP- 1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP- 1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PGR (Fig. 6). The copy number of WISP-l and 
WISP-2 was significantly greater . than one, approximately 
2-fold for WISP- J in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISP-3 was indistinguishable from one (P = 
0.166). In addition, the copy number of WISPS was signifi- 
cantly higher than that of WISP-l (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 
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Fk;. 5. Amplification of WISP- 1 genomic DNA in colon cancer cell 
lines. {A) Am pi i Heat ion in cell line DNA was determined by quanti- 
tative PCR. (/V) Southern blots containing genomic DNA (It) /i.g) 
dignsied with F.roR) (IV/SP-P) or.Mw/I (c-myc) were hybridized with 
a 100-bp human WISP-] probe (amino acids IS6-2I 1 )) or a human 
c-rnyc probe (located ;il bp 1 90 J -2000). The WISP and myc genes are 
detected in normal hum an genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by qua ntita live PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three limes. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to > 25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 1 0-fold overex press ion. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP- 1, WISPS RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fro. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from I he same patient. 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means i* SEM from one experiment 
done in triplicate. The experiment was repeated at leas! twice. 



Proc. Natl. Acad. ScL USA 95 (1998) 14721 

mucosa. The amount of overexpression of WISP-3 ranged from 
4- lo >40-fold. 

DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy/ SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wm-1. 

Three of the genes isolated, WISP-1 1 WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and hov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wm-1. 
The first was C57MG ceils infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnl-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., p-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-1 -transformed cells, hours 
or days after Wnt-1 transformation. Thus, WfSP expression 
could result from Wnt-1 signaling directly through 0-catenin 
transcription factor regulation or alternatively through Wnl-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs.. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present, in CTGF, Cyr6l, nov, WISP-1, and WISP-3. 
This domain is thought lo be involved in receptor binding and 
dimerization. Growth factors, such as TGF-|3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dirners (32). It is templing to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
o\ nov. A recent report has shown that integrin otvfo serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression. of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to thai controlling connective 
tissue formation during wound repair. It has been proposed 
that mammary lurnor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-/31, which is the stimulus for 
stromal proliferation (34). TGF-/J1 is secreted by a large 
percentage of malignant breasl tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

it was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply W1SP-1 and 
WISP-2 to regulate tumor cell growLh on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP- 1 
and WfSP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP- 1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP-2 was localized to chromosome 20ql2-20qJ3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-1, the rat orthotogue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis col i and /3-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic /3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, Lhe 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in CS7MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development.. 
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Beschreibung 

Die Erfindung betrifft menschliche Nukleinsaurescquenzen aus Blasentumorgewebe, die fur Genprodukle oder Teile 
davon kodieren, deren fonktionale Gene, die mindestens ein biologisch aktives Polypeptid kodieren und deien \ferwen- 

dU Die Erfindung betrifft weiterhin die fiber die Sequenzen erhaltlichen Polypeptide und deren Verwendung. 

Eine der Hauptkrebstodesursachen ist der Blasentumor, fur dessen Bekampfung neue Therapien notwendig sind. Bis- 
her verwendete Tberapien, wiez.B. Chemotherapie, Honnontherapie oder chirurgische Entfemung des Tumorgewebes, 
fuhren haufig nicht zu einer vollstandigen Heilung. # 

Das Phanomen Krebs geht haufig einher mit der ttber- oder Unterexpression gewisser Gene in den entarteten Zellen, 
wobei Doch unklar ist, ob diese veranderten Expressionsraten Ursache oder Folge der malignen Transformation sind. Die 
Identifikation solcher Gene ware ein wesentlicher Schritt fur die Entwicklung neuer Therapien gegen Krebs Der spon- 
tanen Entstehung von Krebs geht haufig eine Vielzahl von Mutationen voraus. Diese konnen verschiedenste Auswirkun- 
een auf das Expressionsmuster in dem betroffenen Gewebe haben, wie z. B. Unter- oder tJberexpression, aber auch &c- 
15 pression verkfirzter Gene. Meniere solcher Veranderungen durch solche Mutationskaskaden konnen schhefihch zu bos- 
artigen Entartungen fuhren. Die Komplexitat solcher Zusammenhange erschwert die experimented Herangehensweise 

^Rir die Suche nach Kandidatengenen, d. h. Genen, die im Vergleich zum Tumorgewebe im normalen Gewebe starker 
exprimiert werden, wird eine Datenbank verwendet, die aus sogenannten ESTs besteht. ESTs (Expressed Sequence Tags) 

20 sind Sequenzen von cDNAs, d. h. revers transkribierten mRNAs, den Molekulen also, die die Expression von Genen wi- 
derspieeeln. Die EST-Sequenzen werden fur normale und entartete Gewebe ermittelt. Solche Datenbanken werden von 
verschiedenen Betreibem z. T. kommerziell angeboten. Die ESTs der lifeSeq-Datenbank, die hier verwendet wird, sind 
in der Regel zwischen 150 und 350 Nukleotide lang. Sie reprasentieren ein fur ein bestirnmtes Gen unverkennbares Mu- 
ster obwohl dieses Gen nonnalerweise sehr viel langer ist (> 2000 Nukleotide). Durch Vergleich der Expressionsmuster 

25 von normalen und Tumorgewebe konnen ESTs identifiziert werden, die fur die Tbmorentstehung und -prohferation 
wichtig sind. Es besteht jedoch folgendes Problem: Da durch unterschieoTiche Konstruknonen der cDNA-Bibuotbeken 
die eefundenen EST-Sequenzen zu unterschiedUchen Regionen eines unbekannten Gens gehoren konnen, ergabe ach in 
einem solchen FaU ein vollig falsches Verhaltnis des Vorkommens dieser ESTs in dem jeweiligen Gewebe. Dieses wurde 
erst bemerkt werden, wenn das vollstandige Gen bekannt ist und somit die ESTs dem gleichen Gen zugeordnet werden 

30 k °fe wiirde nun gefunden, daB diese FehlermogHchkeit verringert werden kann, wenn zuvor samtliche ESTs aus dem je- 
weiligen Gewebstyp assembUert werden, bevor die Expressionsmuster miteinander verglichen werden. Es wurden also 
uberlappende ESTs ein und desselben Gens zu langeren Sequenzen zusammengefaBt (s. Fife. 1, Fig, 2a und ^8- 3). 
Durch diese Verlangerung und damit Abdeckung eines wesentlich groBeren Genbereichs m jeder der jewemgen Banken 

35 sollte der oben beschriebene Fehler weitgehendst vermieden werden. Da es hierzu keine bestehenden Softwareprodukte 
gab wurden Programme fur das Assemblieren von genomischen Abschnitten verwendet, die abgewandelt emgesetzt und 
durch eieene Programme erganzt wurden. Ein Flowchart der Assemblierungsprozedur ist in Fig. 2bl-2b4 dargesteUt 
Es konnten nun die Nukleinsaure-Sequenzen Seq. ID No. 1 50 gefunden werden, die als Kandidatengene beam Bla- 
sentumor eine Rolle spielen. - A 

40 VonbesondercmMeressesindmeN^^ 16 > 1», 2U, 23, Zo-Z/, 31-^:50, 

Die Erfindung betrifft somit Nukleinsaure-Sequenzen, die ein Genprodukt oder ein Teil davon kodieren, umfassend 

a) eine Nukleinsaure-Sequenz, ausgewahlt aus der Gruppe der Nukleinsaure-Sequenzen Seq. ID Nos. 2-5, 7-13, 
45 16, 18, 20, 23, 26-27, 31-32, 36, 45. 

b) eine allelische Variation der unter a) genannten Nukleinsaure-Sequenzen 
oder 

c) eine Nukleinsaure-Sequenz, die komplementar zu den unter a) oder b) genannten Nukleinsaure-Sequenzen ist 



50 



55 



60 



Die Erfindung betrifft weiterhin eine Nukleinsaure-Sequenz gemaB einer der Sequenzen Seq. ID Nos. 2-5, 7-13, 16, 
18 20 23 26-27, 31-32, 36, 45 oder eine kompiementare oder aUeUsche Variante davon Und die Nuklemsaure-Sequen- 
zen da'von, die eine 90%ige bis 95%ige Homologie zu einer humanen Nukleinsaure-Sequenz aufweisen. 

Die Erfindung betrifft auch die Nukleinsaure-Sequenzen Seq. ID No. 1 bis Seq. ID No. 50, die im Blasentumorgewebe 
erhoht exprimiert sind. . „ ■ 

Die Erfindung betrifft femer Nukleinsaure-Sequenzen, umfassend einen Ted der oben genannten Nukleinsaure-Se- 
quenzen, in solch einer ausreichenden GroBe, daB sie mit den Sequenzen Seq. ID Nos. 1-50 hybridisieren 

Die erfindungsgemaBen Nukleinsaure-Sequenzen weisen im allgemeinen eine Lange von mindestens 50 bis 4500 bp, 
vorzugsweise eine Lange von mindestens 150 bis 4000 bp, insbesondere eine Lange von 450 bis 3500 bp auf. 

Mit den erfindungsgemaBen Teilsequenzen Seq. ID Nos. 1-50 konnen gemaB gangiger Verfahrenspraxis auch Expres- 
sionskassetten konstruiert werden, wobei auf der Kassette mindestens eine der erfindungsgemaBen Nukleinsaure-Se- 
quenzen zusammen mit mindestens einer dem Faehmann allgemein bekannten KontroU- oder regulatonschen Sequenz, 
wie z B einem geeigneten Promotor, kombiniert wird. Die erfindungsgemaBen Sequenzen konnen in sense oder anti- 



sense Unentierung eingeiugi sem. 
In der Literatur sind ist eine groBe Anzahl von Expressionskassetten bzw. Vektoren und Promotoren bekannt, die ver- 

65 wendet werden konnen. ^ . _> . V11yl 

Unter Expressionskassetten bzw. Vektoren sind zu versteben: 1. bakteneUe, wie z. B., phagescnpt, pBs, <t>Al /4, pr- 
escript SK, pBs KS, pNHSa, pNH16a, P nH18a, pNH46a (Stratagene), P Trc99A, pKK223-3, PDR540, P RIT5 
™ F ^ „.,,™,™^,JL , r. n wT ^ rv^vo^t rSKldA nXTI nSG fStrataeene), pSVK3, pBPV, pMSG, 
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pSVL (Pharmacia). 

Unter Kontroll- oder regulatorischer Sequenz sind geeignete Promotoren zu verstehen. Hierbei sind zwei bevorzugte 
Vektoren der pKK232-8 und der PCM7 Vektor. Im einzelnen sind folgende Promotoren gemeint: lad, lacZ, T3, T7, gpt, 
lambda P R , tic, CMV, HSV Thymidin-Kinase, SV40, LTRs aus Retrovirus und Maus Metallothionein-L . 

Die auf der Expressionskassette befindlichen DNA-Sequenzen konnen ein Fusionsprotein kodieren, das ein bekanntes s 
Protein und ein bioiogisch akdves Polypeptid-Fragment umfaBt 

Die Expressionskassetten sind ebenfalls Gegenstand der vorliegenden Erfindung. 

Die erfindungsgemaBen Nukleinsaure-Fragmente konnen zur Herstellung von Vbllangen-Genen verwendet werden. 
Die erhaltlichen Gene sind ebenfalls Gegenstand der vorliegenden Erfindung. 

Die Erfindung betrifft auch die Verwendung der erfindungsgemaBen Nukleinsaure-Sequenzen, sowie die aus der Ver- 10 
wendung erhaltlichen Gen-Fragmente. 

Die erfindungsgemaBen Nukleinsaure-Sequenzen konnen mit geeigneten vektoren in Wirtszellen gebracht werden, in 
denen als heterologer Teil die auf den Nukleinsaure-Fragmenten enthaltene genetischen Information befindct, die expri- 
miert wird. 

Die die Nukleinsaure-Fragmente enthaltenden WirtszelleD sind ebenfalls Gegenstand der vorliegenden Erfindung. 15 

Geeignete Wirtszellen sind z. B. prokaryontische Zellsysteme wie E. coli oder eukaryontische Zellsysteme wie tieri- 
sche oder humane Zellen oder Hefen. 

Die erfindungsgemaBen Nukleinsaure-Sequenzen konnen in sense oder antisense Form verwendet werden. 

Die Herstellung der Polypeptide oder deren Fragment erfolgt durch Kulti vierung der Wirtszellen gemaB gangiger Kul- 
ti vierungsmethoden und anschlieBender Isolierung und Aufreinigung der Peptide bzw. Fragmente, ebenfalls mittels gan- 20 
giger Verfahren. Die Erfindung betrifft ferner Nukleinsaure-Sequenzen, die mindestens eine Teilsequenz eines bioio- 
gisch aktiven Polypeptids kodieren. 

Ferner betrifft die vorliegende Erfindung Polypeptid-Tfeilsequenzen, sogenannte ORF (open-reading-frame)-Peptide, 
gemaB den Sequenzprotokollen ORF ID Nos. 5 1-106. 

Die Erfindung betrifft ferner die Polypeptid-Sequenzen, die mindestens eine 80%ige Homologie, insbesondere eine 25 
90%ige Homologie zu den erfindungsgemaBen Polypeptid-Teilsequenzen der ORF. ID Nos. 51-106 aufweisen. 

Die Erfindung betrifft auch Antikorper, die gegen ein Polypeptid oder Fragment davon gerichtete sind, welche von den 
erfindungsgemaBen Nukleinsauren der Sequenzen Seq. ID No. 1 bis Seq. ID 50 kodiert werden. 

Unter Antikorper sind insbesondere monoklonale und Phage-Display- Antikorper zu verstehen. 

Die erfindungsgemaBen Polypeptide der Sequenzen ORF ID Nos. 51-106 konnen auch als Tool zum Auffinden von 30 
Wirkstoffen gegen den Blasentumor verwendet werden, was ebenfalls Gegenstand der vorliegenden Erfindung ist 

Ebenfalls Gegenstand der vorliegenden Erfindung ist die Verwendung der Nukleinsaure-Sequenzen gemaB den Se- 
quenzen Seq. ID No. 1-50 zur Expression von Polypeptiden, die als Tools zum Auffinden von Wirkstoffen gegen den 
Blasentumor verwendet werden konnen. 

Die Erfindung betrifft auch die Verwendung der gefundenen Polypeptid-Teilsequenzen ORE ID No. 51-106 als Arz- 35 
neimittel in der Gentherapie zur Behandlung gegen den Blasentumor, bzw. zur Herstellung eines Arzneimittels zur Be- 
handlung gegen den Blasentumor. 

Die Erfindung betrifft auch Arzneimittel, die mindestens eine Polypeptid-Teilsequenz ORF. ID No. 51-106 enthalten. 

Die gefundenen erfindungsgemaBen Nukleinsaure-Sequenzen konnen auch genomische oder mRNA-Sequenzen sein. 

Die Erfindung betrifft auch genomische Gene, ihre Exon- und Intronstruktur und deren SpleiBvarianten, erhaltlich aus 40 
den cDNAs der Sequenzen Seq. ID No. 1-50, sowie deren Verwendung zusammen mit geeigneten regulativen Elemen- 
ten, wie geeigneten Promotoren und/oder Enhancem. ^ 

Mit den erfindungsgemaBen Nukleinsauren (cDNA-Sequenzen) Seq. ID No. 1-50 werden genomische BAC-, PAC- 
und Cosmid-Bibliotheken gescreent und fiber komplementare Basenpaarung (Hybridisierung) spezifisch humane Klone 
isoliert Die so isolierten BAC-, PAC- und Cosmid-Klone werden mit Hilfe der Ruoreszenz-in-situ-Hybridisation auf 45 
Metaphasenchromosomen hybridisiert und entsprechende Chromosomenabschnitte identifiziert, auf denen die entspre- 
chenden gehomischen Gene liegen. BAC-, PAC- und Cosmid-Klone werden sequenziert, um die entsprechenden genom- 
ischen Gene in ihrer vollstandigen Struktur (Promotoren, Enhancer, Silencer, Exons und Introns) aufzuklaren. BAC-, 
PAC- und Cosmid-Klone konnen als eigenstandige Molekule fur den Gentransfer eingesetzt werden (s. Fig. 5). 

Die Erfindung betrifft auch BAC-, PAC- und Cosmid-Klone, enthaltend funktionelle Gene und ihre chromosomale 50 
Legalisation, entsprechend den Sequenzen Seq. ID. No. 1 bis Seq. ID No. 50, zur verwendung als Vehikel zum Gentrans- 
fer. 

Bedeutungen von Fachbegriffen und Abkiirzungen 

55 

Nukleinsauren = Unter Nukleinsauren sind in der vorliegenden Erfindung zu verstehen: mRNA, partielle cDNA, vollan- 
gen cDNA und genomische Gene (Chromosomen). 

ORF = Open Reading Frame, eine definierte Abfolge von Aminosauren, die von der cDNA-Sequenz abgeleitet werden 
kann. 

Contig = eine Menge von DNA-Sequenzen, die aufgrund sehr groBer Ahnlichkeiten zu einer Sequenz zusammengefaBt 60 
werden konnen (Consensus) 

Singleton = ein Contig, der nur eine Sequenz enthalt. 

Erklarung zu den Alignmentparametern 

65 

minimal initial match = minimaler anfanglicher Identitatsbereich 
maximum pads per read = maximale Anzahl von Insertionen 
maximum percent mismatch - maximale Abweichung in %. 
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Erklarung der Abbildungen 

Fig, 1 zeigt die systematische Gen-Suche in der Incyte IifeSeq Datenbank. 
Fig, 2a zeigt das Prinzip der EST-Assemblierung 
5 Fig. 2bl-2b4 zeigt das gesamte Prinzip der EST-Assemblierung 

Fig. 3 zeigt die in silico Subtraktion der Genexpression in verschiedenen Geweben 

Fig. 4a zeigt die Bestimmung der gewebsspezifischeo Expression iiber elektronischen Northern. 

Fig. 4b zeigt den elektronischen Northern 

Fig. 5 zeigt die Isolierung von genomischen BAC- und PAC-Klonen. 
10 Die nachfolgenden Beispiele erlautem die Herstellung der erfindungsgemaBen Nukleinsaure-Sequenzen, ohne die Er- 
findung auf diese Beispiele und Nukleinsaure-Sequenzen zu beschranken. 

Beispiel 1 

l5 Suche nach Tumor-bezogenen Kandidatengenen 

Zuerst wurden samtliche ESTs des entsprechenden Gewebes aus der lifeSeq-Datenbank (vom Oktober 1997) extra- 
hiert. Diese wurden dann mittels des Programms GAP4 des Staden-Pakets mit den Parametem 0% mismatch, 8 pads per 
read und einem minimalen match von 20 assembliert Die nicht in die GAP4-Datenbank aufgenommenen Sequenzen 

20 (Fails) wurden erst bei 1% mismatch und dann nochmals bei 2% mismatch mit der Datenbank assembliert Aus den Con- 
tigs der Datenbank, die aus mehr als einer Sequenz bestanden, wurden Consensussequenzen errechnet Die Singletons 
der Datenbank, die nur aus einer Sequenz bestanden, wurden mit den nicht in die GAP4-Datenbank aufgenommenen Se- 
quenzen bei 2% mismatch emeut assembliert Wiederum wurden fur die Contigs die Consensussequenzen ermittelt Alle 
ubrigen ESTs wurden bei 4% mismatch emeut assembliert Die Consensussequenzen wurden abermals extrahiert und 

25 mit den vorherigen Consensussequenzen sowie den Singletons und den nicht in die Datenbank aufgenommenen Sequen- 
zen abschlieBend bei 4% mismatch assembliert Die Consensussequenzen wurden gebildet und mit den Singletons und 
Fails als Ausgangsbasis fur die Gewebsvergleiche verwendet Durch diese Prozedur konnte sichergestellt werden, daB 
unter den verwendeten Parametem samtliche Sequenzen von einander unabhangige Genbereiche darstellten. 
Fig. 2bl-2b4 veranschaulicht die Verlangemng der Blasengewebe ESTs. 

30 Die so assemblierten Sequenzen der jeweiligen Gewebe wurden anschlieBend mittels des gleichen Programms mitein- 
ander verglichen (Fig. 3). Hierzu wurden erst alle Sequenzen des ersten Gewebes in die Datenbank eingegeben. (Daher 
war es wichtig, daB diese voneinander unabhangig waren.) 

Dann wurden alle Sequenzen des zweiten Gewebes mit alien des ersten verglichen. Das Ergebnis waren Sequenzen, 
die fur das erste bzw. das zweite Gewebe spezifisch waren, sowie welche, die in beiden vorkamen. Bei Letzteren wurde 

35 das Verhaltnis der Haufigkeit des Vorkommens in den jeweiligen Geweben ausgewertet Samtliche, die Auswertung der 
assemblierten Sequenzen betreffenden Programme, wurden selbst entwickelt. 

Alle Sequenzen, die mehr als viermal in jeweils einem der verglichenen Gewebe vorkamen, sowie alle, die mindestens 
funfmal so haufig in einem der beiden Gewebe vorkamen wurden weiter untersucht. Diese Sequenzen wurden einem 
elektronischen Northern (s. Beispiel 2.1) unterzogen, wodurch die Verteilung in samtlichen Tumor- und Normal-Gewe- 

40 ben untersucht wurde (s. Fig. 4a und Fig. 4b). Die relevanten Kandidaten wurden dann mit Hilfe samtlicher Incyte ESTs 
und alien ESTs offentlicher Datenbanken verlangert (s. Beispiel 3). AnschlieBend wurden die Sequenzen und ihre Uber- 
setzung in mogliche Proteine mit alien Nukleotid- und Proteindatenbanken verglichen, sowie auf mogliche, fur Proteine 
kodierende Regionen untersucht 

45 Beispiel 2 

Algorithmus zur Identifikation und Verlangerung von partiellen cDNA-Sequenzen mit verandertem Expressionsmuster 

Im folgenden soli ein Algorithmus zur Auffindung uber- oder untercxprimierter Gene erlautert werden. Die einzelnen 
50 Schritte sind der besseren tJbersicht halber auch in einem FluBdiagramm zusammengefaBt (s. Fig. 4b). 

2.1 Elektronischer Northern-Blot 

Zu einer partiellen DNA-Sequenz S, z. B. einem einzelnen EST oder einem Contig von ESTs, werden mittels eines 
55 Standardprogramms zur Homolgiesuche, z. B. BLAST (Altschul, S. R, Gish W., Miller, W., Myers, E. W. und lipman, 
D J (1990) J. MoL Biol., 215, 403^10), BLAST2 (Altschul, S. R, Madden, T. L., Schaffer, A. A., Zhang, J., Zhang, Z., 
Miller, W. und Lipman, D. J. (1997) Nucleic Acids Research 25 3389-3402) oder FASTA (Pearson, W. R. und lipman, 
D. J. (1988) Proc. Nad. Acad. Sci. USA 85 2444-2448), die homologen Sequenzen in verschiedenen nach Geweben ge- 
ordneten (privaten oder offentlichen) EST-BibHotheken besummL Die dadurch ermittelten (relativen oder absoluten) 
60 Gewebe-spezifischen Vorkommenshaufigkeiten dieser Partial-Sequenz S werden als elektronischer Northern-Blot be- 
zeichnet 

2.1.1 

65 Analog der unter 2.1 beschriebenen Verfahrensweise wurde die Sequenz Seq. ED No. 16 gefunden, die 17,7.x starker 
im normalen Blasentumorgewebe als im normalem Blasengewebe vorkommt 
Das Ergebnis ist wie folgt: 
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Elektronischer Northern fiir SEQ. ID. NO: 16 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeuf igkeit %Haeuf igkeit N/T 


T/N 


Blase 


0.0039 


0.0690 


0.0565 17.6998 


Brust 


0.0000 


0.0000 


undef 


undef 


Duenndarm 


0.0031 


0.0000 


undef 


0.0000 


Eierstock 


0.0180 


0.0078 


2.3025 0.4343 


Endokr ine s_Gewebe 


0.0000 


0.0000 


undef 


undef 


Gastrointestinal 


0.0000 


0.0000 


undef 


undef 


Gehirn 


0.0000 


0.0000 


undef 


undef 


Haeinatopoetisch 


0.0000 


0.0000 


undef 


undef 


Haut 


0.0000 


0.0000 


undef 


undef 


Hepatisch 


0.0000 


0.0000 


undef 


undef 


Herz 


0.0000 


0.0000 


undef 


undef 


Hoden 


0.0000 


0.0000 


undef 


undef 


Lunge 


0.0000 


0.0000 


undef 


undef 


Magen-Speiseroehre 


0.0000 


0.0000 


undef 


undef 


Muskel-SJceiett 


0.0000 


0.0000 


undef 


undef 


Niere 


.0.0000 


0.0000 


undef 


undef 


Pankreas 


0.0000 


0.0000 


undef 


undef 


Penis 


0.0000 


0.0000 


undef 


undef 


Prostata 


0.0065 


0.0064 


1.0236 0.9769 


Oterus_Endometrium 


0.0000 


0.0000 


undef 


undef 


Uterus__Myometrium 


0.0000 


0.0000 


undef 


undef 


Uterus_allgemein 


0.0000 


0.0JDC0 


undef 


undef 



Brust-Hyperplasie 0.0000 

Pros tata-Byperplasie 0 . 017 8 

Samenblase 0.0000 

Sinnesorgane 0.0000 

Weisse_Blutkoerperchen 0.0000 

Zervix 0.0000 



FOETUS 

%Haeufigkeit 
Entwicklung 0.0000 
Gastrointenstinal 0.0028 
Gehirn 0.0000 
Haeinatopoetisch 0.0000 

Haut 0.0000 ' 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0000 
Lunge 0.0000 
Nebenniere 0.0000 
Niere 0.0000 
Placenta 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000 



Eierstock_n 
Eierstock_t 
EndokrinesjGewebe 
""Foetal 
. Gastrointestinal 
. Haeinatopoetisch 



NORMIERTE/SUBTRAHIEBTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.6000 
0.0000 
0.0000 
0.0000 
0.0035 
0.0000 
0.0000 



Haut-Muskel 0.0000 



Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Oterus n 



0000 
0000 
0000 
0068 
0000 
0000 



In analoger Verfahrensweise wurden auch folgende Northerns gefunden: 
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Elektronischer Northern fur SEQ. ID. NO: 8 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Blase 
Brust 
Duenndarm 
Eierstock 
EndokrinesjGewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
UterusJSndometrium 
OterusJ Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
S innes organe 
Weisse_Blutkoerperchen 
Zervix 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 



TUMOR 

%Haeufigkeit 
0.0281 
0.00S6 
0.0000 
0.0078 
0.0000 
0.0000 
0.0010 
0.0000 
0.0000 
.0323 
.0000 
.0585 
.0123 
.0000 
.0120 
0.0274 
0.0110 
0.0000 
0.0021 
0.00C0 
0.CC00 • - 
0.0000 



Verhaeltnisse 
N/T T/N 
0.0000 undef 
0.6805 1.4694 
undef 0.0000 
0.0000 undef 
undef undef 
undef 0.0000 
0.0000 undef 
.undef 0.0000 
undef 0.0000 
0 . 0000 undef 
undef undef 
0.0000 undef ' 
1.1854 0.8436 
undef undef 
0.9994 1.0006 
0.0000 undef 
0.5983 1.6714 
undef undef 
1.0236 0,9769 
undef undef: * 
UAdef undef 
undef undef 



Placenta 
Prostata 
Sinnesorgane 



Brust 
Eierstock_n 
Eierstock_t 

EndokrinesjGewebe 0.0000 

Foetal 0.0000 

. ■ Gastrointestinal 0.0000 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden O.tJOOO 

Lunge 0.0000 



NORMAL 
%Haeufigkeit 
0.0000 
0.0038 
0.0399 
0.0000 
0.0000 
0.0479 
0.0000 , 
0.0227 
0.0037 
.0000 
,0000 
.0000 
.0145 
.0000 
.0120 
.0000 
.0066 
0.0000 
0.0022 
0.0000 
0.0000 
0.0000 
0.0128 
0.0030 
0.0000 
0.0000 
0.0009 
0.0000 



FOETUS 

SHaeufigkeit 

0.0000 

0.0000 

0.0000 

0.0039 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit , 
0.0000 
0.0000 
0.0000 



Nerven 
Prostata 
" Sinnesorgane 
Uterus n 



0.0000 
0.0000 
0.0000 
0.0000 



65 
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Elektronischer Northern fur SEQ. ID. NO: 2 





NORMAL 


TUMOR . 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T 


T/N 


Bias & 


n nnnn 


U m UJU / 


0. 


0000 


undef 


Brust 


0. 0307 


ft nnc 
U* UJ / o 


. 0. 


8166 


1. 


2245 


U UC 1 * 1 1 VI CI 1, III 




ft ni 
U. UlOO 


2. 


0391 


0. 


4904 




ft m on 
U. Ul^U 


ft ftOCA 


0. 


3289 


3. 


0402 






0 0075 


3. 


3962 


0. 


2944 




0 • UX3.7 


ft ni oc 


0. 


8283 


1. 


2072 


Gehirn 


f\ A1 DC 

0.0185 


A ftOI C 


0. 


8571 


1. 


1667 




0 • 0201 


ft ft^T Q 


0. 


5293 


X. 


8892 




0.0844 


ft nnftn 
U. UUUU 


undef 


0. 


0000 


nepa t J- own 


0.0238 


ft ftftfC 

U. UUOD 


3. 


6765 


0. 


2720 


xieirz 


0.0148 


ft nftftn 
0. UUUU 


undef 


0. 


0000 


Hoden 


ft ft C 


a ft*aci 

U • U J3J. 


1. 


6399 


0. 


6098 


Lunge 


0.0145 


0.0082 


1. 


7781 


0. 


5624 




0.0387 


0.0077 


.5. 


0421 


0. 


1983 


Muskel-Skelett 


0.0308 


0.0300 


1. 


0280 


0. 


9728 


Niere 


0.0217 


0.0000 


undef 


0. 


0000 


Pankreas 


0.0099 


0.0110 


0. 


8974 


1. 


1143 


Penis 


0.0240 


0.0000 


undef 


0. 


0000 


Prostata 


0.0262 


0.0213 


1. 


2284 


0. 


8141 


Uterus_EndGmetriuni 


0.0135 


0.0000 


undef 


0. 


0000 


Uterus_Myametrium 


0.0152 


0.0408 


0. 


3741 


2. 


6732 


Oterusjallgemein 


0.0204 


0.0954 


0. 


2135 


4. 


6839 



Brust-Hyperplasie 0.0512 

Prostata-Hyperplasie 0 . 0268 

Samenblase 0.0089 

Slnnesorgane 0 . 0235 

WeisseJ31utkoerperchen 0 . 028 6 

Zervix 0.0106 



FOETUS 

%Haeufigkeit- 

Entwicklung 0.0278 

Gastrointenstinal 0.0305 

Gehirn 0.0063 

Haematopoetisch 0.0157 

Haut 0.0000 

Hepatisch 0.0000 

Herz-Blutgefaesse 0.0213 

Lunge 0.0289 

Nebenniere 0.0000 

Niere 0.0185 

Placenta 0.0121 

Prostata 0.0000 

Sinnesorgane 0.0126 



NORMIERTE/SOBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0204 
0.0000 
0.0051 



Eierstock_n 
Eierstock^t 

Endokr ines_Gewebe 0.0000 

Foetal 0.0122 

Gastrointestinal 0.0488 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0463 

Lunge 0.0164 

Nerven 0.0100 

Prostata 0.0137 

Sinnesorgane 0 . 0000 

Uterus n 0.0125 



DE 198 18 619 A 1 

Hektronischer Northern fur SEQ. ID. NO: 3 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
' Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Ute rusJMyomet rium 
Dterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
30 weisseJBlutkoerperchen 
Zervix 



10 



is 



20 



25 



NORMAL 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

o.oobo 

0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0030 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 



TUMOR 

%Haeufigkeit 

0.0256 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 



Verhaeltnisse 
N/T T/N 
0.0000 undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef 0.0000 
undef undef 
undef undef 
undef undef 
undef undef 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
tfiere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000. 



NQRMCEERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0000 
Eierstock_n 0.0000 
Eierstock^t 0.0000 
Endokrines_Gewebe 0.0000 
Foetal 0.0000 
Gastrointestinal 0.0000 
Haematopoetisch 0 . 0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0000 
Nerven 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 4 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haema t opoe t is ch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_ Endometrium 
Uterus_Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
~ Zervix 



NORMAL 
%Haeufigkeit 
0.0000 
0.0090 
0.0123 
0.0150 
0.0068 
0038 
0044 
0187 
.0000 
0000 
0.0138 
0.0173 
0.0093 
0.0097 
0.0017 
0.0136 
0.0149 
0.0150 
0.0065 
0.0000 
0.0000 
0.0000 
0.0032 
0.0178 
0.0089 
0.0000 
0.0035 
0.0000 



TUMOR 

%Haeufigkeit 
0.0230 
0.0094 
0.0165 
0.0104 
0.0125 
0.0046 
0123 
.0000 
.0000 
.0388 
0.0000 
0.0000 
0.0143 
0.0000 
0.0180 
0.0137 
0.0055 
0.0000 
0.0064 
0.0000 
0.0136 
0.0000 



Verhaeltnisse 
N/T T/N 
0.0000 undef 
0.9527 1.0496 
0.7415 1.3487 
1.4391 0.6949 
0.5434 1.8403 
0.8283 1.2072 
0.3600 2.7779 
undef 0.0000 
undef undef 
0.0000 undef 
undef 0.0000 
undef 0.0000 
0.6532 1.5310 
.undef 0.0000 
0.0952 10.5060 
0.9913 1.0088 
2.6923 0.3714 
undef 0.0000 
1.0236 0.9769 
undef undef 
0.0000 undef 
undef undef 



10 



15 



20 



25 



30 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0167 

0.0000 

0.0354 

0.0000 

0.0000 

0.0000 

0.0072 

0.0000 

0.0062 

0.0000 

0.0000 

0.0000. 



Brust 
Eierstock_n 
Eierstoclc_t 
Endokrines_Gewebe 

Foetal 0.0151 

Gastrointestinal 0.0000 

Haematopoetisch 0.0114 

Haut-Muskel 0.0259 

Hoden 0.0077 

Lunge 0.0000 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.0272 
0.0000 
0.0051 
0.0000 



Nerven 
Prostata 
Sinnesorgane 
Uterus n 



0.0060 
0.0068 
0.0155 
0.0250 



35 



40 



45 



50 



55 



60 



65 
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Hektronischer Northern fur SEQ. ID. NO: 5 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokr ine s_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_EndGmetrium 
Uterus_Myometrium 
0terus__ allgemein 
Brus t-Hyperplas ie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
30 weisse_Blutkoerperchen 
Zervix 



10 



is 



20 



25 



NORMAL 
%Haeufigkeit 
0.0000 
0.0026 
0.0031 
0.0060 
0051 
0134 
0052 
0000 
0000 
0000 
0.0032 
0.0000 
0.0031 
0.0000 
0.0034 
0.0190 
0.0017 
0.0000 
0.0131 
0.0068 
0.0000 
0.0000 
0.0000 
0.0059 
0.0000 
0.0000 
0.0000 
0.0000 



TUMOR 

%Haeufigkeit 

0.0204 

0.0056 

0.0000 

0.0156 

0.0000 

0.0093 

0.0051 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0020 

0.0000 

0.0060 

0.0068 

0.0000 

0.0000 

0.0043 

0.0000 

0.0000 

0.0000 



Verhaeltnisse 
N/T T/N 
0.0000 undef 
0.4537 2.2042 
undef 0.0000 
0.3838 2.6058 
undef 0.0000 
1.4496 0. 6898 
1.0079 0.9921 
undef undef 
undef undef 
undef undef 
undef 0.0000 
undef undef 
1.5241 0.6561 
undef undef 
0.5711 1.7510 
2.7756 0.3603 
undef 0.0000 
undef undef 
3.0709 0.3256 
undef 0.0000 
undef undef 
undef undef 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit* 

0.0000 

0.0083 

0.0063 

0.0079 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000- 



50 



55 



60 



65 



Brust 
Eierstockja 
Eierstock^t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0000 

Lunge 0.0082 

Nerven 0.0050 

Prostata 0.0137 

Sinnesorgane 0.0000 

Uterus n 0.0000 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.0000 
0.0000 
0.0051 
0.0000 
0.0000 
0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 6 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Otems_Endometrium 
Dterus_Myametrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplas ie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 
%Haeufigkeit 
0.0000 
0.0038 
0.0031 
0.0060 
0.0017 
0.0038 
0.0022 
0.0053 
0.0000 
0.0048 
0.0042 
0.0000 
0.0052 
0.0000 
0.0017 
0.0000 
0.0017 
0.0000 
0.0000 
0.0135 
0.0229 
0.0204 
0.0000 
0.0000 
0.0000 
0.0353 
0.0009 
0.0000 



TUMOR 

%Haeufigkeit 
0.0204 
.0075 
.0000 
.0052 
.0000 
.0000 
.0021 
.0000 
0.0000 
0.0000 
0.0137 
0.0000 
0.0143 
0.0000 
0.0060 
0.0000 
0.0110 
0.0000 
0.0021 
0.0000 
0.0068 
0.0000 



Verhaeltnisse 
N/T T/N 
0.0000 undef 
0.5104 1.9593 
undef 0.0000 
1.1513 0.8686 
undef 0.0000 
undef 0.0000 
1.0799 0.9260 
undef 0.0000 
undef undef 
undef 0.0000 
0.3084 3.2426 
undef undef 
0.3629 2.7557 
undef undef 
0.2856 3.5020 
undef undef 
0.1496 6.6857 
undef undef 
0.0000 undef 
undef 0.0000 
3.3668 0.2970 
undef 0.0000 



Entwicklung 
Gastrointenstinal 
Gehim 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0118 

0.0000 

0.0000 

0.0000 

0.0036 

0.0000 

0.0124 

0.0242 

0.0000 

0.0000 



Brust 



normierte/ subtrahierte bibliotheken 

%Haeuf igkeit " 
0.0000 



Eierstockja 0.0000 
Eierstock t 0.0000 
EndokrinesjGeweSe 0.0245 
Foetal 0.0151 
Gastrointestinal 0-0000 
Haematopoetisch 0.0000 
Haut-Muskel 0.0259 
Hoden 0.0000 
Lunge 0.0164 
Nerven 0.0080 
Prostata 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0250 



11 
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Elektronischer Northern fur SEQ. ID. NO: 7 



10 



15 



20 



25 



30 



Blase 
Brust 
Duenndarm 
Eierstock 
EndokrinesjGewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
' Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Uterus_Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 

%Haeu£igkeit 

0.0000 

0.0038 

0,0061 

0.0060 

0.0017 

0.00}9 

0.0096 

0.0067 

0.0000 

0.0048 

0.0138 

0.0288 

0.0031 

0.0387 

0.0017 

0.0054 

0.0050 

0.0000 

0.0000 

0.0000 

0.0000 

0.0102 

0.0000 

0.0000 

0.0000 

0.0000 

0.0026 

0.0000 



TUMOR 

%Haeufigkeit 
0.0179 
0.0132 
0.0000 
0.0052 
0.0100 
0.0370 
0.0051 1 
0.0000 
0.0000 
0.0065 
0.0137 
0.0468 
0.0143 
.0153 
,0000 
,0000 
0055 
0000 
0.0021 
0.0000 
0.0000 
0.0000 



Verhaeltnisse 
N/T T/N 
0.0000 undef 
0.2917 3.4287 
undef 0.0000 
1.1513 0.8686 
0.1698 5.8889 
0.0518 19.3158 
1.8719 0.5342 
undef 0.0000 
undef undef 
0.7353 1.3600 
1.0023 0.9977 
0.6150 1. 
0.2177 4. 
2.5211 0. 



.6261 
.5929 
.3967 
0.0000 
0.0000 
1.1143 
undef 



undef 
undef 
0.8974 
undef 
0.0000 undef 
undef undef 
undef undef 
undef 0.0000 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0028 

0.0751 

0.0079 

0.0000 

0.0000 

0.0036 

0.0036 

0.0000 

0.0000 

0.0061 

0.0499 

0.0000. 



50 



55 



60 



65 



Brust 
Eierstock_n 0.0000 
Eierstock^t 0.0000 
EndokrinesjGewebe 0.0000 
Foetal 0.0087 
Gastrointestinal 0 . 0122 
Haematopoetisch 0.0114 
Haut-Muskel 0.0097 
Hoden 0.0540 
Lunge 0.0082 
Nerven 0.0201 
Prostata 0.0205 
Sinnesorgane 0.0000 
Uterus n 0.0375 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.0068 



12 



DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 8 



Blase 
Brust 
Duenndarm 
Eierstock 
EndokrinesjGewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Spe i s eroehxe 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endametrium 
Uterus_Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 

%Haeu£igkeit 

0.0000 

0.0013 

0.0123 

0.0060 

0.0068 

0.0038 

0.0044 

0.0080 

0.0037 

0.0000 

0.0021 

0.0058 

0.0031 

0.0000 

0.0051 

0.0000 

0.0066 

0.0060 

0.0087 

0.0000 

0.0000 

0.0051 

0.0000 

0.0059 

0.0000 

0.0000 

0.0035 

0.0000 



TUMOR 

%Haeufigkeit 
0.0179 
0.0038 
0.0000 
0.0078 
0100 
0093 
0144 
0000 
0000 
0.0000 
0.0000 
0.0000 
0.0082 
0.0307 
0.0060 
0.0000 
0.0000 
0.0000 
0.0064 
0.0000 
0.0068 
0.0000 



Verhaeltnisse 
N/T T/N 
0.0000 undef 
0.3403 2.9389 
undef 0.0000 
0.7675 1.3029 
0.6792 1.4722 
0.4142 2.4145 
0.3086 3.2409 
undef 0.0000 



undef 
undef 
undef 
undef 



0.0000 
undef 
0.0000 
0.0000 



0.3810 2.6245 
0.0000 undef 
0.8567 1.1673 
undef undef 
undef 0.0000 
undef 0.0000 
1.3648 0 .7327 
undef undef 
0.0000 undef 
undef 0.0000 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0056 

0.0000 

0.0000 

0.0000 

0.0000 

0. 0000 

0.0036 

0.0000 

0.0000 

0.0061 

0.0000 

0.0000: 



NORMIERTE /SUEfTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0204 
Eierstock_n 0.0000 
EierstockJ: 0.0608 
Endokrines_Gewebe 0.0000 
Foetal 0.0029 
Gastrointestinal 0.0244 
Haematopoetisch 0.0000 
Haut-Muskel 0.0097 
Hoden 0.0154 
Lunge 0.0000 
Nerven 0.0080 
Prostata 0.0000 
Sinnesorgane 0 . 0000 
Uterus n 0.0000 
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DE 198 18 619 A 1 

Hektronischer Northern fur SEQ. ID. NO: 9 



Blase 
Brust 
Duenndarm 
Eierstock 
EndokrinesjGewebe 
Gastrointestinal 
Gebirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
* Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus J&idometrium 
Dterus_Myometrium 
Ut erus_al Igemein 
B rust-Hype rplasie 
Prostata-Hyperplasie 
Samenblase 
S innesorgane 
30 WeisseJ31utkoerperchen 
Zervix 



10 



15 



20 



25 



NORMAL 
%Haeufigkeit 
0.0000 
0.0064 
0.0000 
0.0030 
0.0153 
0.0038 
0.0015 
0.0067 
0.0037 
0.0000 
0.0053 
0.0230 
0.0042 
0.0000 
0.0034 
0.0054 
0.0017 
0.0030 
.0065 
.0135 
.0152 
.0000 
0.0032 
0.0000 
0.0000 
0.0000 
0.0043 
0.0000 



TUMOR 

%Haeufigkeit 
0.0179 
0.0056 
0.0165 
0.0026 
0.0025 
0.0046 
.0051 
.0000 
.0847 
.0065 
.0000 
0.0234 
0.0041 
0.0077 
0.0000 
0.0068 
0.0000 
0.0267 
0.0064 
0.0000 
0.0000 
0.0000 



Verhaeltnisse 
N/T T/N 
0.0000 undef 
1.1342 0.8817 
0.0000 undef 
1513 0.8686 
1132 0 
8283 1 
2880 3 
undef 0 
0.0433 23 



1636 
2072 
4724 
0000 
0839 



0.0000 undef 
undef 0.0000 
0.9839 1.0163 
1.0161 0.9842 
0.0000 undef 
undef 0.0000 
0.7930 1.2610 
undef 0.0000 
0.1123 8.9035 
1.0236 0.9769 
undef 0.0000 
undef 0.0000 
undef undef 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
S innes organe 



FOETUS 

%Haeuf igkeit 

0.0000 

0.0000 

0.0000 

0.0039 

0.0000 

0.0000 

0.0036 

0.0108 

0.0000 

0-0062 

0.0000 

0.0499 

0.0000 



50 



55 



60 



65 



Brust 
Eierstock_ n 
Eierstock^_t 
EndokrinesjGewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0068 

0.0000 

0.0101 

0.0000 

0.0047 

0.0000 

0.0114 

0.0194 



Hoden 0.0309 
Lunge 0.0000 
Nerven 0.0040 



Prostata 
S innesorgane 
Uterus n 



0.0000 
0.0000 
0.0125 
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DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 10 



Blase 
Brust 
Duenndarm 
Eierstock 
EndokrinesJSewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
' Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
OterusJBndometrium 
Uterus_Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 

%Haeufigkeit 

0,0000 

0-0051 

0.0000 

0.0000 

0.0000 

o.oopo 

0.0000 
0.0000 
0.0000 
0.0000 
0.0011 
0.0058 
0.0021 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 



TUMOR 

%Haeu£igkeit 

0.0153 

0.0019 

0.0000 

0.0000 

0.0000 

0.0093 

0.0113 

0.0000 

0.0000 

0.0129 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 



Verhaeltnisse 
N/T T/N 
0.0000 undef 
2.7221 0.3674 
undef undef 
undef undef 
undef undef 
0.0000 undef 
0.0000 undef 
undef undef 
undef undef 
0.0000 undef 
undef 0.0000 
undef 0.0000 
undef 0.0000 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 
undef undef 



FOETUS 

%Haeufigkeit- 
Entwicklung 0.0000 
Gastrointenstinal 0.0000 
Gehirn 0.0000 
Haematopoetisch 0.0000 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0036 
Lunge 0.0000 
Nebenniere 0.0000 
Niere 0.0000 
Placenta 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000. 



NORMIERTE / SUBTRAHIERTE B IBL I OTHEKEN 
%Haeufigkeit 
Brust 0.0000 
Eierstock_n 0.0000 
EierstockM: 0.0000 - 
Endokrines_Gewebe 0.0000 
Foetal 0.0006 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0000 
Nerven 0.0010 
Prostata 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0000 
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DE 198 18 619 Al 

Elektronischer Northern fur SEQ. ID. NO: 11 



10 



15 



20 



25 



30 



Blase 
Brust 
Duenndann 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometritnh 
Oterus_Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 
%Haeufigkeit 
0.0000 
0038 
0000 
0000 
0068 
0077 
0000 
0.0107 
0.0073 
0.0000 
0.0021 
0.0000 
0.0010 
0.0097 
0.0000 
0.0054 
0.0050 
0.0000 
0.0065 
0.0068 
0.0076 
0.0000 
0.0000 
0.0238 
0.0000 
0.0000 
0.0026 
0.0106 



TUMOR 

%Haeufigkeit 
0.0153 
0.0038 
0.0000 
0.0104 
0.0025 
0.0093 
0.0062 
0.0000 
0.0000 
0.0194 
0.0000 
0.0000 
.0041 
.0077 
.0000 
.0068 
.0000 
.0267 
.0298 
0.0528 
0.0068 
0.0000 



Verhaeltnisse 
N/T T/N 
0 . 0000 undef 
1.0208 0.9796 
undef undef 
0.0000 undef 
2.7170 0.3681 
0.8283 1.2072 
0.0000 undef 
undef 0.0000 
undef 0.0000 
0.0000 undef 
undef 0.0000 
undef undef 
0.2540 3.9367 
.1.2605 0.7933 
undef undef 
0.7930 1.2610 
undef 0.0000 
0.0000 undef 
0.2193 4.5590 
0.1280 7.8106 
1.1223 0.8911 
undef undef 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeif 

0.0139 

0.0194 

0.0063 

0.0000 

0.0000 

0.0260 

0.0071 

0.0108 

0.0000 

0.0000 

0.0121 

0.0499 

0.0000. 



NORMIERTE/SOEfTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0000 

Eierstock^n 0.0000 

Eierstock^t 0.0557 

Endokrines_Gewebe 0.0000 

Foetai 0.0076 

Gastrointestinal 0.0000 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0000 

Lunge 0.0000 

Nerven 0.0030 

Prostata 0.0137 

Sinnesorgane 0 . 0387 

Uterus n 0.0042 
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^ DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 12 





NORMAL 


TUMOR 


Verhaeltnlsse 




%Haeuf igkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0000 


0.0153 


0.0000 undef 


Brust 


0.0013 


0.0000 


undef 0.0000 


Duenndann 


0.0031 


0.0000 


undef 0.0000 


Eier stock 


0.0000 


0.0026 


0 . 0000 undef 


Endokrines Gewefoe 

lull l\i V/JV*. ill 1 WW " ^ 


0.0017 


0.0075 


0.2264 4.4166 


fia«5t"ro intestinal 


0.0000 


0.0093 


0 0000 undef 


Gehirn 


0.0044 


0 0000 

V . WW 


«nde»f 0 0000 


nocHlo Uw^v/ts wxauu 


0 0000 

V . WVw 


0 0000 


U11UCJL UUUCL 


Rant* 


0 0037 








0 0000 


n oooo 


UUUClt UIIUCl 


HerZ 


ft noon 


n noon 


uQUcl UuucX. . 


Hoden 


0 0000 


0 01 17 


0 0000 undef 


Lunge 


0 0000 


0 0000 

yj . V WW 


undef undef 




0 0000 

V . WW 


0.0000 


•undef undef 




0.0000 


0 0000 


undef undef 


Niere 


0.0000 


0 0000 

V ■ WW 


undef undef 


Pankreas 


0.0017 


0 0000 


undef 0.0000 


Penis 


0.0000 


0 0000 


undef undef 


Prostata 


0.0000 


u . uuzx 


\f . W W UIIUCl 


rn*ami cs RT»Hom*»t t"? inn 


0.0000 


0.0000 


undef undef 




0 0000 


0.0000 


undef undef 


Dterus_allgemein 


0.0000 


0.0000 


undef undef 


Brust-Hyperplasie 


0.0032 






Prostata-Hyperplasie 


0.0000 






Samenblase 


0.0000 






Sinnesorgane 


0.0000 






Weisse — Blutkoerperchen 


0.0000 






Zervix 


0.0000 







Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



Brust 
Eierstockja 
Eierstock — t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haema t opoe t is ch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Oterus n 



17 



FOETUS 

%Haeufigkeit* 

0.0000 

0.0000 

0.0125 

0.0000 

0.0000 

0.0000 

0.0000 

0.0036 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000-. 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 > 

0.0245 

0.0023 

0.0000 

0.0000 

0.0000 

0.0000 

0.0082 

0.0020 

0.0000 

0.0000 

0.0000 



DE 198 18 619 A 1 

Elektronischer Northern fur SEQ. ID. NO: 13 



10 



15 



20 



25 



30 



Blase 
Brust 
Duenndann 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoet is ch 
Haut 
Hepatisch 
Her z 
Hoden 
Lunge 

Magen-Speiseroehre 
' Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Uterus_Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 


TUMOR 


Verhaeltnisse 


%Haeuf igkeit %Haeufigkeit N/T T/N 


a aaaa 
U.UUUU 




n nnoo unrfpf 

\J « WWW UliUC X. 


A AA on 


n ni 6Q 


0 5293 1 8893 


A AAO 1 


ft ftftftft 


undef 0 0000 


A AA CA 

U. UUbU 




1 1513 0.8686 


A AAC1 


n f\o*>G. 
u • uzzo 


0 2264 4.4166 


A AftCl 


ft ft9**1 


0 2485 4.0241 


A A A CO 

U.UUoZ 


ft nnpo 


0 6300 1 5874 


0,0107 


A ftftftft 

u. uuuu 


nnr?of A ftftftft 
UllQcl u • uuuu 


0.0037 


A ftftftft 

u. uuuu 


iinHAf ft ftftftft 


0.0000 


A A AAA 

0. UUUU 


uhqcx uuaci 


0.0138 


A A AAA 
0 • UUUU 


nn/4af ft ftftftft 


0.0345 


0.0117 


O Q(\1 ft ft "^ftfl 


0.0021 


ft fti o'k 
U.UJLZJ 


ft 1693 5 9051 


0.0000 


0.0000 


undef undef 


0.0051 


0.0060 


0.8567 1.1673 


0.0163 


0.0068 


2.3791 0.4203 


0.0000 


0.0055 


0.0000 undef 


0.0030 


0.0267 


0.1123 8.9035 


0.0174 


0.0128 


1.3648 0.7327 


0.0066 


0.0000 


undef 0.0000 


0.0000 


0.0000 


undef undef 


0.0051 


0.0000 


undef 0.0000 



0.0000 
0.0149 
0.0267 
0.0118 
0.0087 
0.0000 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeuf igkeit ■ 

0.0139 

0.0139 

0.0063 

0.0079 

0.0000 

0.0000 

0.0107 

0.0072 

0.0000 

0.0000 

0.0061 

0.0249 

0.0000. 



50 



55 



60 



65 



Brust 
Eierstock_n 
Eierstoc)c_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0000 

0.0000 

0.0101 : ; 

0.0000 

0.0146 

0.0000 

0.0057 

0.0000 

0.0309 

0.0000 

0.0010 

0.0137 

0.0000 

0.0083 
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DE 198 18 619 A 1 

Elektroniscfaer Northern fur SEQ. ID. NO: 14 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen- Speiser oehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometriura 
Uterus_Myometrium 
Uterus__allgemein 
Brust-Hyperplas ie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse__Blutkoerperchen 
Zervix 



NORMAL 
%Haeufigkeit 
0.0000 
0.0051 
0000 
0060 
0000 
0115 
0.0000 
0.0027 
0.0037 
0.0381 
0.0021 
0.0000 
0.0073 
0.0000 
0.0137 
0163 
0017 
0000 
0022 
0.0068 
0.0305 
0.0000 
0.0000 
0.0000 
0.0000 
0.0118 
0.0000 
0.0319 



TUMOR 

%Haeufigkeit 

0.0153 

0.0395 

0.0000 

0.0234 

0.0000 

0.0000 

0.0041 

0.0000 

0.0000 

0.0129 

0.0000 

0.0234 

0.0245 

0.0537 

0.0420 

0.0000 

0.0331 

0.0267 

0.0128 

0.0000 

0.0136 

0.0000 



Verhaeltnisse 
N/T T/N 
0 . 0000 undef 
0.1296 7.7146 
undef undef . 
0.2558 3.9088 
undef undef 
undef 0.0000 
0 . 0000 undef 
undef 0.0000 
undef 0.0000 
2.9412 0.3400 
undef 0.0000 
0.0000 undef 
0.2964 3.3743 
0 . 0000 undef 
0.3263 3.0643 
undef 0.0000 
0.0499 20.0570 
0 - 0000 undef 
0.1706 5.8615 
undef 0.0000 
2.2445 0.4455 
undef undef 



? FOETUS 

%Haeufigkeit 
Entwicklung 0.0278 
Gastrointenstinal 0.0583 
Gehim 0.0000 
Haematopoetisch 0.0354 

Haut 0.5025 > 
Hepatisch 0.0000 
Herz-Blutgefaesse 0-0071 
Lunge 0.0542 
Nebenniere 0.0761 
Niere 0.1235 
Placenta 0.0727 
Prostata 0.0249 
Sinnesorgane 0 . 1255. 

NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0204 
Eierstock_n 0.0000 
EierstocX_t 0.0709 
Endokrines_Gewebe 0.0245 
Foetal 0.0175 
Gastrointestinal 0.0122 
Haematopoetisch 0.0000 
Haut-Muskel 0.0162 
Hoden 0.0000 
Lunge 0.0000 
Nerven 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 15 



10 



15 



20 



25 



30 



Blase 
Brust 
Duenndann 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoet is ch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Spe is eroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
UterusJBndametrium 
Uterus_Myometrium 
Uterusjallgemein 
Brust-Byperplasie 
Pros tata-Hyperplas ie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 
%Haeufigkeit 
0.0039 
0.0179 
0.0031 
0.0090 
0.0358 
0.0019 
0.0007 
0000 
.0037 
0000 
,0244 
,0000 
0.0374 
0.0290 
0.0000 
0.0027 
0.0017 
0.0000 
,0065 
,0000 
,0152 
0000 
,0192 
0.0030 
0.0267 
0.0000 
0.0000 
0.0106 



TUMOR 

%Haeufigkeit 

0.1278 

0.0414 

0.1323 

0.0234 

0.0301 

0.0000 

0.0010 

0.0000 

0.0000 

0.0065 

0.0000 

0.0000 

0.0491 

0.0077 

0.0000 

0.0068 

0.2209 

0.0000 

0.0085 

0.0000 

0.0000 

0.0000 



Verhaeltnisse 
K/T T/N 
0.0305 32.7774 
0.4331 2.3091 
0.0232 43.1571 
0.3838 2.6058 
1.1887 0-8413 
undef 0.0000 
0.7200 1.3890 
undef undef 
undef 0.0000 
0.0000 undef 
undef 0.0000 
undef undef 
0.7621 1.3122 
3.7816 0.2644 
undef undef 
0.3965 2.5219 
0.0075 133.7133 
undef undef 
0.7677 1.3026 
undef undef 
undef 0.0000 
undef undef 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 
0.0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0667 
0000 
0000 



50 



55 



60 



65 



Brust 
Eierstocl^ja 
Eierstock_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.1224 

0.0000 

0.1013 

0.0000 
0.0268 
0.0122 
0.0057 
0.0065 
0.0000 
0.0246 
0.0000 
0.0205 
0.0000 
0.0125 
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Elektronisctaer Northern fur SEQ. ID. NO: 17 



Blase 
Brust 
Duenndarm 
Eierstock 
EndokrinesjGewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus^ Endometrium 
Uterus_Myometrium 
Uterus_allgemein 
Brust-Hyperplas ie 
Pros tat a-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 

%Haeufigkeit 

0.0039 

0.0077 

0.0368 

0.0150 

0.0102 

0.0421 

0.0118 

0.0174 

0.0110 

0.0048 

0.0127 

0.0115 

0.0114 

0.0000 

0.0154 

0.0054 

0.0050 

0.0090 

0.0174 

0.0068 

0.0152 

0.0204 

0.0064 

0.0238 

0.0000 

0.0000 

0.0251 

0.0106 



TUMOR 

%Haeufigkeit 
0.0537 
0.0207 
0.0000 
0.0078 
0100 
0093 
0195 
0379 
0000 
0518 
0275 
0.0117 
0.0061 
0.0460 
0.0060 
0.0068 
0.0331 
0.0533 
0.0192 
0.0000 
0.0000 
0.0000 



Verhaeltnisse 
N/T T/N 
0.0726 13.7665 
0.3712 2.6940 
undef 0.0000 
1.9188 0.5212 
1.0189 0.9815 
4.5559 0.2195 
0.6063.1.6494 
0.4587 2.1798 
undef 0.0000 
0.0919 10.8799 
0.4626 2.1618 
0.9839 1.0163 
1.8628 0.5368 
O.0000 undef 
2.5700 0.3891 
0.7930 1.2610 
0.1496 6.6857 
0.1685 5.9357 
0.9099 1.0990 
undef 0.0000 
undef 0.0000 
undef 0.0000 



FOETUS 
%Haeufigkeit 
Entwicklung 0.0000 
Gastrointenstinal 0 . 0167 
Gehirn 0.0438 
Haematopoetisch 0.0118 

Haut 0.0000 ' 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0107 
Lunge 0.0181 
Nebenniere 0.0000 
* Niere 0.0247 

Placenta 0.0061 
Prostata 0.0249 
Sinnesorgane 0.0000. 

NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0408 

Eierstockja 0.0000 

Eierstock^t 0.0101 > 

Endokrines_Gewebe 0.0000 

Foetal 0.0087 

Gastrointestinal 0.0122 

Haematopoetisch 0.0000 

Haut-Muskel 0.0130 

Hoden 0.0000 

Lunge 0.0082 

Nerven 0.0010 

Prostata 0.0068 

Sinnesorgane 0.0000 

Uterus n 0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 18 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeuf igkeit %Haeufigkeit N/T 


T/N 


Blase 


0.0195 


0.2556 


0.0763 


13 


.1109 


Bnist 


U* UXDD 


n ft ^7 




2. 


1477 


Duenndann 


u • UUDJ. 


n ft££9 


ft 0927 


io 


.7893 


LlXCl O kwwA. 


n n^ftQ 

U. UOOj 


U • UUJ^ 


7.4832 


0. 


1336 


ITrtHolrr*! nPQ (*aw^Vi^ 


0. 0392 


0.0326 


1.2017 


0. 


8321 


CZz\ efroi ntestinal 




0 0000 

• Mm WW 


undef 


0. 


0000 


Gehirn 


n nnn7 
u. uuu / 


\J • uouo 


0.0122 


81 


.9491 


Ha^matoDoetisch 


n 0107 


0 0000 

\J . WW 


undef 


0. 


0000 


Haut 


0.0220 


0.5085 


0.0433 


23 


.0839 


Hepatisch. 


0.0238 


0.0518 


0.4596 


2. 


1760 


Herz 


0 0085 


0.0000 


undef 


0. 


0000 


Hoden 


0.0115 


0.0000 


undef 


0. 


0000 


Lunge 


0.0104 


0.0041 


2.5402 


0. 


3937 


Magen-Speiseroehre 


0.0000 


0.0077 


0.0000 


undef 


Muskel-Skelett 


0.0600 


0.0480 


1.2493 


0. 


8005 


Niere 


0.0407 


0.0068 


5.9478 


0. 


1681 


Pankreas 


0.0198 


0.0331 


0.5983 


1. 


6714 


Penis 


0.0030 


0.1066 


0.0281 


35.6140 


Prostata 


0.0000 


0.0021 


0.0000 


undef 


Uterus Endometrium 


0.0405 


0.0000 


undef 


0. 


0000 


Uterus_Myometrium 


0.0305 


0.1155 


0.2641 


3. 


7870 


Uterus allgemein 


0.0153 


0.0000 


undef 


0. 


0000 



Brust-Hyperplasie 0.0064 

Prostata-Hyperplasie 0.0030 

Samenblase 0.0000 

Sinnesorgane 0.0118 



30 Weisse_Blutkoerperchen 0.0000 
Zervix 0.0000 



FOETUS 

35 %Haeufigkeit 
Entwicklung 0.0696 
Gastrointenstinal 0.3332 
Gehirn 0.0000 
Haematopoetisch 0.2202 
40 Haut 0.0000 * 

Hepatisch 1.6381 
Herz-Blutgefaesse 0.0285 
Lunge 0.1337 
Nebenniere 1.0903 
45 " Niere 0.6301 

Placenta 0.6786 
Prostata 0.0499 
Sinnesorgane 0.0000: 

50 NORMIERTE/SUtfTRAHIERTE BIBLIOTHEKEN 

fcHaeufigkeit 

Brust 0.0544 

Eierstockji 0.0000 

55 EierstockJ: 0.1063 

EndokrinesjGewebe 0.0000 

Foetal 0.4264 

Gastrointestinal 0.0000 

Haematopoetisch 0.0000 

sl Haut-Muskel 0.0000 

60 Hoden 0.0000 

Lunge 0.0000 

Nerven 0.0030 

Prostata 0.0000 

Sinnesorgane 0 . 0000 

65 Uterus n 0.0250 
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Elektronischer Northern fur SEQ. ID. NO: 19 



Blase 
Brust 
Duenndarm 
Eierstock 
EndokrinesjGewebe 
Gastrointestinal 
Gehirn 
Haematopoet is ch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Utems^_Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 

%Haeufigkeit 

0.0195 

0,0192 

0.0061 

0.0180 

0.0034 

0.0000 

0.0214 

0.0134 

0.0073 

0.0048 

0.0085 

0.0000 

0.0197 

0.0000 

0.0034* 

0.0109 

0.0083 

0.0000 

0.0000 

0.0203 

0.0000 

0.0153 

0.0128 

0.0030 

0.0000 

0.0235 

0.0000 

0.0213 



TUMOR 

%Haeufigkeit 

0.2301 

0.0113 

0.0331 

0.0000 

0.1555 

0.0370 

0.1561 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0020 

0.0077 

0.1320 

0.0068 

0.0000 

0.0000 

0.0021 

0.0000 

0.0475 

0.0000 



Verhaeltnisse 
N/T T/N 
0.0847 11.7998 
1.7013 0.5878 
0.1854 5.3946 
undef 0.0000 
0.0219 45.6387 
0.0000 undef 
0.1374 7.2801 
undef 0.0000 
undef 0.0000 
undef 0.0000 
undef 0.0000 
undef undef 
9.6527 0.1036 
•0.0000 undef 
0.0260 38.5221 
1.5861 0.6305 
undef 0.0000 
undef undef 
0.0000 undef 
undef 0.0000 
0.0000 undef 
undef 0.0000 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoet is ch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit' 

0.0417 

0.1361 

0.0063 

0.1337 

0.0000 

0.3380 

0.0249 

0.0578 

0.5071 

0.2594 

0.4120 

0.0000 

0.0000. 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0612 
Eierstock_n 0.0000 
Eierstock_t 0.0000 : ; 
EndokrinesjGewebe 0.0000 
Foetai 0.1188 
Gastrointestinal 0.0244 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0154 
Lunge 0.0000 
Nerven 0.0141 
Prostata 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0416 
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Elektronisctaer Northern fur SEQ. ID. NO: 20 



Blase 
Brust 
Duenndarm 
Eierstock 
EndokrinesjGewebe 
Gastrointestinal 
Gehirn 
Haematopoet i sch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Uterus_Myometrium 
0terus_ allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
30 Wei sse_Blutkoerper chen 
Zervix 



10 



15 



20 



25 



NORMAL 


TUMOR 


Verhaeltnisse 


%Haeuf igkeit %Haeufigkeit N/T T/N 


0.0039 


0.0383 


0.1017 9.8332 


0.0077 


0.0075 


1.0208 0.9796 


0.0061 


0.0496 


0.1236 8.0920 


0.0000 


0.0130 


0.0000 undef 


0.0153 


0.0000 


undef 0.0000 


0.0115 


0.0000 


undef 0 . 0000 


0.0022 


0.0031 


0.7200 1.3890 


0.0013 


0.0000 


undef 0.0000 


0.0073 


0.0000 


undef 0.0000 


0.0048 


0.0129 


0.3676 2.7200 


0.0000 


0.0000 


undef undef 


0.0000 


0.0000 


undef undef 


0.0042 


0.0102 


ft A r\ C A O A CftC 

0 . 4064 Z« 4oUO 


0.0290 


0.0077 


3-7816 0.2644 


0.0000 


0.0000 


undef undef 


0.0136 


0.0000 


undef 0.0000 


0.0017 


0.0000 


undef 0.0000 


0.0030 


0.0000 


undef 0.0000 


0.0065 


0.0106 


0.6142 1.6282 


0.0068 


0.0528 


0.1280 7.8106 


0.0000 


0.0000 


undef undef 


0.0000 


0.0000 


undef undef 



0.0032 
0.0268 
0.0267 
0.0235 
0.0000 
0.0319 



FOETUS 
%Haeuf igkeit 
Entwicklung 0.0139 
Gastrointenstinal 0.0139 
Gehirn 0.0000 
Haematopoet isch 0 . 0039 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgef aesse 0.0000 
Lunge 0.0036 
Nebenniere 0-0254 
r Niere 0.0062 

Placenta 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000. 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0204 
Eierstock ja 0-0000 
EierstockT_t 0-0203 ; 
EndokrinesjGewebe 0.0245 
Foetal 0.0128 
Gastrointestinal 0.0122 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0154 
Lunge 0.0082 
Nerven 0.0090 
Prostata 0.0068 
Sinnesorgane 0.0000 
Uterus n 0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 21 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T T/N 


Blase 


U. 


\J • \J-J>Jv 


0.1090 9.1777 


Brust 


0. 0077 


u • uzu / 




T>nonnH a Y~ro 

LJ UCI Ik 1UQ ■!- HI 


ft Afti c 

0, 0215 


ft 

U. UJ-5J. 


0 K4RR 1 5413 




0. 00.5 U 


ft nmo 
u . uu / o 


0 *}fl^R 9 fiOSR 

Uf JOJO AtUU^Q 


juiaojcrine s^beweoc 


f\ nnnn 
u • uuuu 


n 0000 

\J m VwW 


unde f unde f 


/2a e*"TV™ii n^oct*i Vial 


ft ft Cl c 

0-0575 


ft r\r\A £ 


12 4251 


Gehim 


0.0000 


0. UU/Z 


U . UUUU UQttcl. 


naeinatopoetioi-ii 


0.0214 


ft ft AAA 

0. UUUU 




UaUL. 


0.0037 


ft nnnn 
U • UUUU 


\*r\/ia& ft nnnn 

liTii ih r U • UUUU 




ft ft ft ft A 

0 • ouuu 


ft ftt a a 
U ■ UJ.J9 


0 nOOO nndpf 


H63TZ 


ft ft Ann 


ft m 
u . u±o / 


0 0000 nnripf 




U • UUUU 


o n=iRS 

U • UJOf 


0.0000 undef 




n 0104 


0.0225 


0.4618 2.1652 


M*mf»n— Soeiseroelire 


0 0000 


0.0000 


undef undef 


Muskel-Skelett 


0.0257 


0.0120 


'2.1416 0.4669 


Niere 


n nn^A 


0 0000 

IF . U VVU 


undef 0.0000 


Pankreas 


0. 0099 


0.0221 


0.4487 2.2286 


Penis 


n nnnn 


0 0000 


undef undef 


Prostata 


0.0022 


0.0000 


undef 0.0000 


Uterus Endome t r ium 


U • UUUU 


n nnnn 


LI 1 In. 4 w JL HI l%*w *• 


Uterus_Myamet r i urn 


0.0000 


0.0000 


undef undef 


ntonic all rr^Tn^ i T\ 


0.0000 


ft nnnn 
0. UUUU 


unuex unuei. 


Brust-Hyperplasie 


ft ft.no.st 

0. UU7D 






Pros tata—uyperp las ie 


ft Annn 

u • UUUU 






^ QUI ICI1JU « 


ft nnnn 
0. UUUU 






^■i nnesoroane 

If 1 IT A> W 


SV ftft ft ft 

0. 0000 








ft ft ft ft ft 






7.0TT71 V 


0.0000 








FOETUS 








%Haeufigkeit 






£tULnlL>Jki Uliy 


0.0000 






Gastrointenstinal 


0.0000 








A ft ft ft ft 

0.0000 






tiaenia t opoe i.x s t*u 


ft nnnn 

u • UUUU 






Haut 


0.0000 






Hepatisch. 


0.0000 






Herz-Blutgefaesse 


0.0000 






Lunge 


0.0000 






Nebenniere 


0.0000 






Niere 


0.0000 






Placenta 


0.0000 






Prostata 


0.0000 






Sinnesorgane 


0.0000. 







IS 



20 



30 



35 



40 



45 



Brust 
Eierstockjn 
Eierstock[_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0136 

0.0000 

0.0051 ; 

0.0000 

0.0000 

0.0122 

0.0000 

0.0000 

0.0000 

0.0164 

0.0000 

0.0000 

0.0000 

0.0000 



50 



55 



60 



65 



25 



DE 198 18 619 A 1 

Elektronischer Northern fur SEQ. ID. NO: 22 



Blase 
Brust 
Duenndarm 
Eierstock 
EndokrinesjGewebe 
Gas trointest inal 
Gehirn 
Haema topoetis ch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
' Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Uterus_Myometrium 
Uterus_allgemein 
Brust-Hyperplas ie 
Pros tata-Hyperplas ie 
Samenblase 
Sinnesorgane 
30 Weiss e_Blutkoerperchen 
Zervix 



10 



15 



20 



25 



NORMAL 

%Haeufigkeit 

0.0351 

0.0281 

0.0092 

0.0569 

0.0596 

0.0019 

0.0022 

0.0174 

0.0220 

0.0285 

0.0223 

0.0173 

0.0145 

0.0290 

0.0788 

0.0489 

0.0264 

0.0090 

0.0000 

0.1013 

0.0381 

0.0153 

0.0096 

0.0059 

0.0000 

0.0118 

0.0000 

0.0000 



TUMOR 

%Haeufigkeit 
0.3144 
0.0470 
0.0662 
0.0208 
0.0527 
0.0139 
0.0945 
0.0000 
0.5085 
0.0582 
0.0000 
0.0000 
0.0082 
0.0077 
0.0540 
0.0137 
.0442 
,1066 
.0064 
.0000 
0.1494 
0.0954 



Verhaeltnisse 
N/T T/N 
0.1116 8.9591 
0.5989 1.6698 
0.1390 7.1929 
2.7342 0.3657 
1.1321 0.8833 
0.13817.2434 
0.0235 42.5950 
undef 0.0000 
0.0433 23.0839 
0.4902 2.0400 
undef 0.0000 
undef 0.0000 
1.7781 0.5624 
3.7816 0.2644 
1.4595 0.6852 
3.5687 0.2802 
0.5983 1.6714 
0.0842 11.8713 
0.0000 undef 
undef 0.0000 
0.2551 3.9206 
0.1601 6-2452 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit- 

0.0696 

0.4554 

0.0000 

0.2753 

0.0000 

1.6381 

0.0605 

0.1879 

1.1663 

0.8215 

0.7816 

0.0499 

0.0000. 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0612 

Eierstock_n 0.0000 

Eierstock^_t 0.1164 

Endokrines_Gewebe 0.0000 

Foetal 0.4665 

Gastrointestinal 0.0000 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0000 

Lunge 0.0000 

Nerven 0.0030 

Prostata 0.0068 

Sinnesorgane 0.0000 

Uterus n 0.0291 
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Hektronischer Northern fur SEQ. ID. NO: 23 



Blase 
Brust 
Duenndaxm 
Eierstock 
Endokrines^Gewebe 
Gastrointestinal 
Gehirn 
Haexnatopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehxe 
* Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Dterus_Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 
%Raeufigkeit 
0.0039 
0.0038 
0.0184 
0.0150 
0.0170 
0.0192 
0.0133 
0.0174 
0.0220 
0.0048 
0.0339 
0.0288 
0.0218 
.0000 
.0291 
.0136 
.0050 
.0120 
.0153 
0.0203 
0.0000 
0.0051 
0.0128 
0.0178 
0.0267 
0.0118 
0.0243 
0.0213 



TOMOR 

%Haeufigkeit 

0.0332 

0.0263 

0.0331 

0.0156 

0.0075 

0.0324 

0.0113 

0.0000 

0.0000 

0.0388 

0.0137 

0.0234 

0.0184 

0.0307 

0.0240 

0.0137 

0.0166 

0.0000 

0.0149 

0.0000 

0.0204 

0.0000 



Verhaeltnisse 
N/T T/N 
0.1173 8.5221 
0.1458 6.8574 
0.5561 1.7982 
0.9594 1.0423 
2.2642 0.4417 
0.5917 1.6901 
1.1781 0.8488 
undef 0.0000 
undef 0.0000 
0.1225 8.1599 
2.4671 0.4053 
1.2299 0.8130 
1.1854 0.8436 
.0.0000 undef 
1. 2136 0.8240 
0.9913 1.0088 
0.2991 3.3428 
undef 0.0000 
1.0236 0.9769 
undef 0.0000 
0.0000 undef 
undef 0.0000 



FOETUS 

%Haeufigkeit 
Entwicklung 0.0139 
Gastrointenstinal 0.0056 
Gehirn 0.0125 
Haematopoetisch 0.0000 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0320 
Lunge 0.0036 
Nebenniere 0.0254 
Niere 0.0124 
Placenta 0.0121 
Prostata 0.0000 
Sinnesorgane 0.0000- 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0204 
Eierstock_n 0.0000 
EierstockTt 0.0354 
Endokrines_Gewehe 0.0000 
Foetal 0.0105 
Gastrointestinal 0.0000 
Haematopoetisch 0.0171 
Haut-Muskel 0.0454 
Hoden 0.0000 
Lunge 0.0246 
Nerven 0.0211 
Prostata 0.0000 
Sinnesorgane 0 . 0000 
Uterus n 0.0000 
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Elektronischer Northern ffir SEQ. ID. NO: 24 



5 



10 



20 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0,0195 


0.1457 


0.1338 


7.4732 


Brust 


0.0013 


0.0169 


. 0.0756 


13.2250 


Duenndarm 


0.0061 


0.0496 


0.1236 


8.0920 


Eierstock 


0.0090 


0.0260 


0.3454 


2.8954 


Endokrines GeweJbe 


0.0119 


0.0125 


0.9509 


1.0516 


Gastrointestinal 


0.1648 


0.0231 


7 . 1237 


0.1404 


Gehirn 


0.0030 


0 0308 


0.0960 


10.4173 


Haenta t opoe t i s ch 


0.2099 


0 0000 


un.de f 


0.0000 


Haut 


0.0000 


0 0000 


under! 


undef 


Hena t i s rh 


0.0000 


0 2006 


0.0000 


undef 


Herz 


0.0032 


0.0137 


0.2313 


4.3235 


Hoden 


0.0000 


0.0819 


0.0000 


undef 


Lunge 


0.0956 


0.2965 


0.3223 


3.1023 


Mag en~ Spei s eroehre 


0.0097 


0.1917 


0.0504 


19.8329 


*" Muskel-Skelett 


0.0959 


0.0240 


3.9977 


0.2501 


Niere 


0.0081 


0.0205 


0.3965 


2.5219 


Pankreas 


0.0182 


0.0497 


0.3656 


2.7350 


Penis 


0.0030 


0.1333 


0.0225 


44.5175 


Prostata 


0.0131 


0.0064 


2.0473 


0.4885 


Oterus_Endometrium 


0.0000 


0.0000 


undef 


undef 


Uterus_Myometrium 


0.0000 


0.0000 


undef 


undef 


Dt e rus_ allgemein 


0.0051 


0.0000 


undef 


0.0000 


Brust-Hyperplasie 


0.0096 









Prostata-Hyperplasie 0 . 0030 

Samenblase 0.0267 

Sinnesorgane 0.0706 

30 Weisse_Blutkoerperchen 0.0017 

Zervix 0.0000 



FOETUS 

35 %Haeufigkeit 

Entwicklung 0.0000 

Gastrointenstinal 0.0000 

Gehirn 0.0000 

Haematopoetisch 0.0000 

40 Haut 0.0000 

Hepatisch 0.0000 

Herz-Blutgef aesse 0 . 0000 

Lunge 0.0000 

Nebenniere 0.0000 

45 Niere 0.0000 

Placenta 0.0000 

Prostata 0.0000 

Sinnesorgane 0 . 0000.. 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeuf igkeit 
Brust 0.0068 
Eierstock_ n 0.0000 
EierstockM: 0.2076 
Endokrines_Gewebe 0.0000 
Foetal 0.0006 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0082 
Nerven 0.0030 
Prostata 0.0068 
Sinnesorgane 0.0000 
Uterus n 0.0042 
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Hektronischer Northern fur SEQ. ID. NO: 25 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
0terus_Endometriura 
Uterus^Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 
%Haeuf igkeit 
0.0039 
0.0217 
0.0123 
0000 
0102 
0115 
0148 
0.0107 
0.0220 
0.0000 
0.0085 
0.0058 
0.0145 
0.0000 
0.0086 
0.0081 
0.0050 
0.0030 
0.0109 
0-0203 
0.0076 
0.0102 
0.0096 
0.0059 
0.0089 
0.0000 
0.0286 
0.0106 



TUMOR 

%Haeufigkeit 

0.0281 

0.0132 

0.0000 

0.0026 

0.0150 

0.0046 

0.0154 

0.0000 

0.0847 

0.0000 

0.0000 

0.0117 

0.0061 

0.0077 

0.0360 

0.0000 

0.0166 

0.0000 

0.0064 

0.0000 

0.0068 

0.0000 



Verhaeltnisse 
N/T T/N 
0.1387 7.2110 
1.6527 0.6051 
undef 0.0000 
0.0000 undef 
0.6792 1.4722 
2.4850 0.4024 
0.9599 1.0417 
undef 0.0000 
0.2599 3.8473 
undef undef 
undef 0.0000 
0.4920 2.0326 
2.3708 0.4218 
0.0000 undef 
6.2380 4.2024 
undef 0.0000 
0.2991 3.3428 
undef 0.0000 
1.7060 0.5862 
undef 0.0000 
1.1223 0.8911 
undef 0.0000 



:• FOETUS 

%Haeuf igkeit- 
Entwicklung 0.0000 
Gastrointenstinal 0.0028 
Gehirn 0.0125 
Haematopoetisch 0.0000 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgef aesse 0 . 0000 
Lunge 0.0036 
Kebenniere 0.0000 
Niere 0.0000 
Placenta 0.0061 
Prostata 0.0000 
Sinnesorgane 0.0000, 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0204 
Eierstockjn 0.0000 
EierstockM; 0.0051 
EndokrinesjGewebe 0.0000 
Foetal 0.0029 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0000 
Nerven 0.0010 
Prostata 0.0068 
Sinnesorgane 0.0000 
Uterus n 0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 26 



10 



15 



20 



25 



30 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeuf igkeit %Haeufigkeit N/T T/N 


Blase 


0.0039 


.0.0256 


0.1525 6.5555 


Brust 


0.0038 


0.0056 


0.6805 1.4694 


Duenndarm 


0.0031 


0.0000 


undef 0.0000 


Eierstock 


0.0210 


0.0052 


4.0294 0.2482 


Endokrines_Gewebe 


0.0017 


0.0000 


undef 0.0000 


Gastrointestinal 


0.0057 


0.0000 


undef 0.0000 


Gehirn 


0.0037 


0.0031 


1.1999 0.8334 


Haematopoetisch 


0.0040 


0.0000 


undef 0.0000 


Haut 


0.0073 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.0000 


undef undef 


Herz 


0.0000 


0.0000 


undef undef 


Hoden 


0.0000 


0.0000 


undef undef 


Lunge 


0-0073 


0.0020 


3.5562 0.2812 


Magen-Speiseroehre 


0.0000 


0.0000 


undef undef 


Muskel-Skelett 


0.0069 


0.0000 


undef 0.0000 


Niere 


0.0027 


0.0000 


undef 0.0000 


Pankreas 


0.0017 


0.0000 


undef 0.0000 


Penis 


0.0030 


0.0000 


undef 0.0000 


Prostata 


0.0000 


0.0021 


0.0000 undef 


Uterus_Endometrium 


0.0000 


0.0000 


undef undef 


Uterus_Mycmetrium 


0.0076 


0.0068 


1.1223 0.8911 


Uterus_allgemein 


0.0102 


0.0000 


undef 0.000C 


Brust-Hyperplasie 


0.0000 







Prostata-Hyperplasie 0.0030 

Samenblase 0.0089 

Sinnesorgane 0.0000 

Weisse_Blutkoerperchen 0.0121 

Zervix 0.0000 



FOETUS 

35 %Haeufigkeit 

Entwicklung 0.0000 

Gastrointenstinal 0.0139 

Gehirn 0.0063 

Haematopoetisch 0.0039 

40 Haut 0.0000 

Hepatisch 0.0000 

Herz-Blutgefaesse 0.0036 

Lunge 0.0036 

Nebenniere 0.0000 

45 * Niere 0.0062 

Placenta 0.0000 

Prostata 0.0000 

Sinnesorgane 0.0000 t 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeuf igkeit 

Brust 0.0000 

Eierstockja 0.0000 

EierstockM: 0.0101 

Endokxines_Gewebe 0-0000 

Foetal 0.0029 

Gastrointestinal 0.0488 

Haematopoetisch 0.0114 

Haut-Muskel 0-0130 

Hoden 0.0154 

Lunge 0.0000 

Nerven 0.0020 

Prostata 0.0274 

Sinnesorgane 0 . 0155 

Uterus n 0.0083 
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Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehim 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen- Spe is eroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Uterus_Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerpercheh 
Zervix 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Hebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



Brust 
Eierstockja 
Eierstock]_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
= : Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Oterus n 



Hektronischer Northern fur SEQ. ID. NO: 27 



NORMAL 


TUMOR 


Verhaeltnisse 


%Haeufigkeit %Haeufigkeit N/T T/N 


0.0039 


0.0256 


0.1525 6.5555 


0.0115 


0.0150 


0.76561.3062 


0.0307 


0.0000 


undef 0.0000 


0.0150 


0.0052 


2.8781 0.3474 


0.0119 


0.0150 


0.7925 1.2619 


0.0057 


0.0139 


0.4142 2.4145 


0.0074 


0.0092 


0.8000 1.2501 


0.0147 


0.0000 


undef 0.0000 


0.0037 


0.1695 


0.0217 46.1678 


0 0048 


0 0000 


undef 0.0000 


0.0074 


0.0137 


0.5397 1.8529 


n noon 


0.0234 


0 • 0000 undef 


0. 0104 


0.0082 


1.2701 0.7873 


0.0000 


0.0230 


0.0000 undef 


0.0086 


0.0000 


undef 0.0000 


0.0190 


0.0000 


undef 0.0000 


0.0066 


0.0055 


1.1966 0.8357 


0.0150 


0.0267 


0.5616 1.7807 


0.0087 


0.0149 


0.5849 1.7096 


0.0203 


0.0000 


undef 0.0000 


0.0000 


0.0136 


0.0000 undef 


0.0051 


0.0000 


undef 0.0000 



0.0064 
0.0178 
0.0178 
0.0235 
0.0104 
0.0000 



FOETUS 

%Haeufigkeit' 

0.0000 

0.0111 

0.0125 

0.0000 

0.0000 

0.0260 

0.0071 

0.0217 

0.0254 

0.0062 

0.0061 

0.0000 

0.0000. 



NORMIERTE/SDBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0136 

0.0000 

0.0101 

0.0245 

0.0181 

0.0244 

0.0114 

0.0065 

0.0000 

0.0164 

0.0090 

0.0068 

0.0232 

0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 28 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehim 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Uterus_Myametrium 
Uterus — allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
30 Weisse_Blutkoerperchen 
Zervix 



10 



15 



20 



25 



NORMAL 


TUMOR 


Verhaeltnisse 


%Haeufigkeit %Haeufigkeit N/T T/N 


0.0312 


0.1968 


0.1585 6.3096 


0.0281 


0.0526 


0.5347 1.8702 


0.2177 


0.0000 


undef 0.0000 


0.0000 


0.0494 


0.0000 undef 


0.0017 


0.0000 


undef 0.0000 


0.2778 


0.0324 


8.5792 0.1166 


0.0007 


• 0.0133 


0.0554 18.0566 


0.1470 


0.0000 


undef 0.0000 


0.0551 


0.0000 


undef 0.0000 


0.0000 


0.1876 


0.0000 undef 


0.0011 


0.0137 


0.0771 12.9706 


0.0000 


0.2339 


0.0000 undef 


0.0623 


0.1206 


0.5166 1.9356 


0.0000 


0.0153 


0.0000 undef 


0.1216 


0.0300 


4.0548 0.2466 


0.0081 


0.1027 


0.0793 12.6097 


0.0743 


0.2209 


0.3365 2.9714 


0.0000 


0.0000 


undef undef 


0.0065 


0.0149 


0.4387 2.2795 


0.0068 


0.0000 


undef 0.0000 


0.0000 


0.0000 


undef undef 


0.0000 


0.0000 


undef undef 



0.0384 
0.0030 
0.0000 
0.0118 
0.0017 
0.0000 



FOETUS 

35 %Haeufigkeit. 

Entwicklung 0.0000 

Gastrointenstinal 0.0000 

Gehirn 0.0000 

Haematopoetisch 0.0079 

40 Haut 0.0000 

Hepatisch 0.0000 

Herz-Blutgef aesse 0 . 0000 

Lunge 0.0000 

Nebenniere 0.0000 

45 , Niere 0.0000 

Placenta 0.0000 

Prostata 0.0000 

Sinnesorgane 0.0000 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0068 

Eierstock_n 0.0000 

Eierstock^t 0.0051 

Endokrines_Gewebe 0.0000 ~ 

Foetal 0.0000 

Gastrointestinal 0.0000 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0000 

Lunge 0.0082 

N erven 0.0000 

Prostata 0.0000 

Sinnesorgane 0.0000 

Uterus n 0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 29 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T T/N 


Blase 


0.0273 


0.1713 


0.1594 6.2745 


Brust 


0.0217 


0.0395 


0.5509 1.8152 


Duenndarm 


0.1901 


0.0000 


undef 0.0000 


Eier stock 


0.0000 


0.0390 


0.0000 undef 


Endokrines Gewebe 


0.0017 


0.0000 


undef 0.0000 


Gastrointestinal 




0.0278 


8.9737 0.1114 


Gehirn 


o nono 


0.0113 


0 . 0000 undef 


OQCJIIrl LW^vv wawwaa 




0 0000 

V » W V V 


undef 0 . 0000 


Haut 


n 04*77 


0 0000 


undef 0.0000 


Hpoatisch 

HC^/B \m III *it\rt 1 


o nnnn 


0.1423 


0.0000 undef 


Herz 


0. 0032 


0.0000 


undef 0 . 0000 


Hoden 


0 0000 


0.2105 


0.0000 undef 


Lunge 


0.0457 


0.0818 


0.5588 1.7894 


Magen— Speiseroehre 


0.0000 


0.0153 


p. 0000 undef 


' Muskel-Skelett 


0.0822 


0.0240 


3.4266 0.2916 


Niere 


0.0054 


0.1027 


0.0529 18.9146 


Pankreas 


0.0694 


0.2430 


0.2855 3.5020 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0153 


0.0106 


1.4331 0.6978 


Uterus_Endometrium 


0.0068 


0.0000 


undef 0.0000 


Oterus_Myometri\im 


0.0000 


0.0000 


undef undef 


Uterus allgemein 


0 0000 


0.0000 


undef undef 


Brust -Hype rplasie 


0.0288 






Prn^fata-flvnerolasie 


0.0030 






Samenblase 


0 0000 

V * M W V 






S inneso rgane 


o ni i a 






w*» isse Rlut ko erx>e r cheii 


0 0009 






Zervix 


n nnnn 








FOETUS 














Entwi ckl ung 


n nnnn 






Gas t roin tens tinal 


n 0000' 






Gebirn 


o 0000 






Haematopoetisch 


0.0079 






Haut 


0.0000 






Hepatisch 


0.0000 






Herz-Blutgefaesse 


0.0000 






Lunge 


0.0000 






Nebenniere 


0.0000 






Niere 


0.0000 






Placenta 


0.0000 






Prostata 


O.0000 






Sinnesorgane 


0.0000 , 







NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeuf igkeit 
Brust 0.0068 
Eierstock_n 0.0000 
BierstocK^t 0.0000 
Endokrines_ Gewebe 0.0000 
Foetal 0.0000 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
— Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0082 
Nerven 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0000 
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Elektronischer Northern fur SEQ. ED. NO: 30 



5 



10 



15 



20 



45 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T T/N 


Bla.se 


n fiasfi 

U • UOJO 


0*4525 


0.1896 5.2742 


Brust 


a 1 OTQ 

0. 1^ /y 


A AQ77 


1 3087 0 7641 


Duenndann 


A JQQQ 


a 01 50 


2 3242 0 4302 


EilCiO LUtJV 


a ni on 

U • UJ.OU 


n 1353 


0 1328 7.5280 


r«nu.OKXXiies oeweuc 


0 0307 


0.0176 


1.7466 0.5725 


uaSLrOllltco U illdX 


fl T 4 Oil 




5 5413 0 1805 


Cphirn 


0. 0030 






Haemstopoetiscri 


0. 6884 


a nnnn 
0. 0000 


itviftA^ A A A A A 

unaer u.uuuu 


Haut 


0.0551 


0.1695 


A OO/IO O AT70. 


Hepatiscn 


0.0381 


0.7635 


A Ail OO OA nCQD 

u.u^yy £u.uj7o 


Herz 


0.0095 


0.0412 


a 0^1 a "30^5 

U.£.3X<3 4* J^OO 


Hoden 


/\ a Ann 

0. 0000 


n i ii no 




Lunge 


U • j£ / / J 


n 541 ft 


0 5119 1 9536 


Magen— Spe is e roehre 


u. uo / D 


fl 3450 


0.19615.0999 






U * UODU 


4 8544 0 2060 


Niere 


n ai eo 


n 1 7fl0 

U.l'OU 


0 0915 10 9284 




u • UO^J 


0 2319 


0.2279 4.3875 


Penis 


n ofloo 


0 0000 


unde f unde f 


Prostata 


0.0828 


0.0192 


4.3220 0.2314 


nt^i-w«* Endometrium 


a nnco 
0. OOoo 


a nnnrt 
u. uuuu 


n«H^f O 0000 


Uterus_Myometrium 


0.0076 


0.0204 


0.37412.6732 


Uterus allgemein 


0. 0102 


n aaaa 
U • UUUU 


imHof n nnno 


Brust-Hyperplasie 


A "1 Oil O 
U. 






Prostata— Hype rplasie 


a A9£Ct 






JolllcilUXoo ts 


n non 
0. Uzb / 








a no co 
0. 0353 






wexsse DiUuKoeipercxicu 


n nn it O 






Zsrvix 


0. 0852 








FOETUS 








%Haeuf igkeit 






Entwxcklung 


0.0000 






Gastrointenstinal 


0.0000 






Gehirn 


0.0000 






Haematopoetisch 


A ni T O 






Haut 


0.0000 








0.0000 






Herz-Blutgefaesse 


0.0000 






Lunge 


0.0000 






Nebenniere 


0.0000 






Niere 


0.0000 






Placenta 


0.0000 






Prostata 


0.0000 






Sinnesorgane 


0.0000 







NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.1837 

Eierstock_n 0.0000 

EierstockM; 0.0405 

EndokrinesjGewebe 0.0000 

Foetal 0.0023 

Gastrointestinal 0.0976 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0000 

Lunge 0.0000 

Nerven 0.0020 

Prostata 0.0000 

Sinnesorgane 0.0000 

Uterus n 0.0042 
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Hektronischer Northern fur SEQ. ID. NO: 31 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetis ch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
• Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endametrium 
Uterus_Myometriuia 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 


TUMOR 


Verhaeltnisse 


%Haeufigkeit %Haeufigkeit N/T T/N 


0 0039 


0 0204 


0.1907 5.2444 


n no77 


0 0056 


1.3611 0.7347 


n mfl4 

\J • UlOi 


0.0331 


0.5561 1.7982 


0 0150 


0.0104 


1.4391 0.6949 


0.0102 


0.0000 


undef 0.0000 


0*0019 


0.0046 


0.4142 2.4145 


0 0074 


' 0.0072 


1.0285 0.9723 


0 0067 


0 0000 


undef 0.0000 


n ni 47 


0 0000 


undef 0.0000 


n nnnn 
u . uuuu 


0. 0065 


0.0000 undef 


ft nn£4 


0 0000 


undef 0.0000 


u. uo^o 


ri 0534 


1 4759 0. 6775 


0 0073 


0.0143 


0.5080 1.9684 


0.0000 


0.0000 


undef undef 


0.0034 


0.0000 


-undef 0.0000 


0.0054 


0.0000 


undef 0.0000 


0.0050 


0.0055 


0.8974 1.1143 


0.0030 


0.0000 


undef 0.0000 


0.0153 


0.0043 


3.5827 0.2791 


0.0068 


0.0000 


undef 0.0000 


0.0000 


0.0068 


0.0000 undef 


0.0000 


0.0000 


undef undef 



0.0000 
0.0089 
0.0178 
0.0000 
0.0069 
0.0106 



FOETUS 

%Haeufigkeit. 
Entwicklung 0.0000 
Gastrointenstinal 0.0111 
Gehirn 0.0000 
Haexnatopoetisch 0.0039 
Haut 0.2513 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0071 
Lunge 0.0036 
Nebenniere 0.0000 
Niere 0.0062 
Placenta 0.0061 
Prostata 0.0249 
Sinnesorgane 0 . 0000 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0136 

Eierstock_n 0.0000 

Eierstocfe^t 0.0152 

EndokrinesJ3ewebe 0.0000 

Foetal 0.0041 

Gastrointestinal 0.0122 

Haematopoetisch 0.0000 

Haut-Muskel 0.0130 

Hoden 0.0077 

Lunge 0.0000 

Nerven 0.0040 

Prostata 0.0137 

Sinnesorgane 0 . 0000 

1 Uterus n 0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 32 



10 



15 



20 



25 



30 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatiscn 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endametrium 
Uterus_Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 

%Haeufigkeit 

0.0039 

0.0141 

0.0061 

0.0090 

0.0153 

0.0192 

0.0222 

0.0094 

0.0110 

0.0048 

0.0159 

0.0000 

0.0135 

0.0097 

0.0188 

0.0163 

0.0083 

0.0090 

0.0109 

0.0068 

0.0076 

0.0102 

0.0032 

0.0119 

0.0089 

0.0235 

0.0087 

0.0000 



TDMOR 

%Haeufigkeit 

0.0204 

0.0282 

0.0000 

0.0312 

0.0176 

0,0231 

0.0113 

0.0000 

0.0000 

0.0129 

0.0000 

0.0234 

0.0123 

0.0153 

0.0300 

0.0205 

0.0221 

0.0267 

0.0043 

0.0000 

0.0204 

0.0954 



Verhaeltnisse 
N/T T/N 
0.1907 5.2444 
0.4991 2.0038 
undef 0.0000 
0.2878 3.4745 
0.8733 1.1451 
0.8283 1.2072 
1.9635 0.5093 
undef 0.0000 
undef 0.0000 
0.3676 2.7200 
undef 0.0000 
0.0000 undef 
1.1007 0.9085 
0.6303 1.5866 
0.6282 1.5918 
0.7930 1.2610 
0.3739 2.6743 
0.3369 2.9678 
2.5591 0.3908 
undef 0.0000 
0.3741 2.6732 
0.1067 9.3678 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatiscn 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
: Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0111 

0.0000 

0.0039 

0.0000 

0.0260 

0.0107 

0.0072 

0.0000 

0.0124 

0.0000 

0.0000 

0.0000i 



50 



55 



60 



65 



Brust 
Eierstock n 
Eierstock^t 
EndokrinesjGewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 

Haut-Muskel 0.0032 
Hoden 0.0154 
Lunge 0.0164 
Nerven 0.0060 
Prostata 0.0068 
Sinnesorgane 0 . 0000 
Uterus n 0.0083 



NORMIERTE/SUSTRAHIERTE bibliotheken 
%Haeufigkeit 
0.0000 
0.1595 

0.0203 ; 
0.0000 
0.0058 
0.0488 
0.0114 
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Elektronischer Northern fur SEQ. ID. NO: 33 

NORMAL 
%Haeufigkeit 
Blase 0.0429 
Brust 0,0409 
Duenndarm 0.1870 
Eierstock 0.0030 
Endokrines^Gewebe 0 • 0034 
Gastrointestinal 0.2740 
Gehirn 0.0007 
Haematopoetisch 0.2165 
Haut 0.0220 
Hepatisch 0.0095 
Herz 0.0074 
Hoden 0.0000 
Lunge 0.1039 
Magen-Speiseroehre 0*0387 
' Muskel-Skelett 0.0994 
Miere 0.0054 
Pankreas 0.0231 
Penis 0.0000 
Prostata 0.0262 
Uterus_Endometrium 0.0068 
Uterus_Myometrium 0.0076 
Uterus_allgemein 0.0051 
Brust-Hyperplasie 0.0512 
Prostata-Hyperplasie 0 . 0178 
Samenblase 0.0089 
S innesorgane 0.0118 
Weisse_Blutkoerperchen 0*0000 
Zervix 0.0319 



FOETUS 
%Haeufigkeit* 

Entwicklung 0 . 0000 

Gastrointenstinal 0.0000 

Gehirn 0.0000 

Haematopoetisch 0. 0079 

Haut 0.0000 

Hepatisch 0.0000 

Herz-Blutgef aesse 0 . 0000 

Lunge 0.0000 

Nebenniere 0.0000 

Niere 0.0000 

* Placenta 0.0000 

Prostata 0-0000 

Sinnesorgane 0 . 0000 



NORMIERTE / SOBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0612 

Eierstock ja 0.0000 

EierstockM: 0.0152 

Endokrines_Gewebe 0.0000 

Foetal 0.0012 

Gastrointestinal 0.0366 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0000 

Lunge 0.0000 

Nerven 0.0000 

Prostata 0.0000 

Sinnesorgane 0 . 0000 

Uterus n 0.0000 



TUMOR 


Verhaeltnisse 


%Haeufigkeit N/T T/N 


0 71 73 


0 1974 5.0656 


0 0357 


1.1462 0.8725 




2 8269 0.3537 


0 0R33 


0.0360 27.7957 


o nnso 


0 6792 1.4722 


0*0740 


3.7016 0.2702 


n 091 


0 0343 29.1683 


n nnnn 


undef 0.0000 


n hnnn 


iinripf 0.0000 


n **^9Q 

U ■ J?«7 


0 0277 36 0397 


u. u*£ /*> 


0 26Q8 3 7059 




u . w w m iViiw^ 


U * X /JO 


0.5977 1.6731 


0.0920 


0.4202 2.3799 


0.0420 


2.3660 0.4227 


0.0753 


0.0721 13.8707 


0.1160 


0.1994 5.0142 


0.0000 


undef undef 


0.0000 


undef 0.0000 


0.0000 


undef 0.0000 


0.0000 


undef 0.0000 


0.0000 


undef 0.0000 
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Hektronischer Northern fur SEQ. ID. NO: 34 



10 



15 



20 



25 



30 



Blase 
Brust 
Duenndarm 
Eierstock 
EndokrinesjGewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
- Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_ Endometrium 
Uterus_Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
We is se_Blutkoerperchen 
Zervix 



NORMAL 
%Haeufigkeit 
0.0117 
0.0102 
0.0153 
0.0539 
0.0187 
0.0134 
0.0133 
0.0134 
0.0147 
0.0000 
O.0170 
0.0173 
0.0062 
0.0676 
0.0171 
0.0109 
0.0066 
0.0269 
0.0327 
0.0068 
0.0534 
0.0306 
0.0000 
0.0030 
0.0000 
0.0118 
0.0061 
0.0213 



TUMOR 

%Haeufigkeit 
0.0588 
0.0094 
0.0331 
0.0130 
0.0075 
0.0093 
0.0123 
0.0000 
0.0000 
.0129 
.0137 
.0234 
.0020 
.0000 
.0180 
0.0274 
0.0110 
0.0533 
0.0213 
0.0000 
0.0000 
0.0000 



Verhaeltnisse 
N/T T/N 
0.1990 5.0259 
1.0888 0.9184 
0.4634 2.1579 
4.1445 0.2413 
2.4906 0.4015 
1.4496 0.6898 
1.0799 0.9260 
undef 0.0000 
undef 0.0000 
0.0000 undef 
1.2336 0.8107 
0.7380 1.3551 
3.0482 0.3281 
undef 0.0000 
0.9518 1.0506 
0.3965 2.5219 
0.5983 1.6714 
0.5054 1.9786 
1.5354 0.6513 
undef 0.0000 
undef 0.0000 
undef 0.0000 



FOETUS 

%Haeufigkeit. 

Entwicklung 0.0139 

Gastrointenstinal 0 . 0555 

Gehirn 0.0000 

Haematopoetisch 0.0079 

Haut 0.0000 ;; 

Hepatisch 0.0260 

Herz-Blutgefaesse 0.0071 

Lunge 0.0253 

Nebenniere 0.0254 

Niere 0.0000 

Placenta 0.0485 

Prostata 0.0000 

Sinnesorgane 0 . 0000 

NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0204 
Eierstockja 0.0000 
Eierstockj: 0.2430 
Endokrines_Gewebe 0.0245 
Foetal 0.0338 
Gastrointestinal 0.0122 
Haematopoetisch 0.0171 
Haut-Muskel 0.0680 
Hoden 0.0077 
Lunge 0.0000 
Nerven 0.0151 
Prostata 0.0342 
Sinnesorgane 0. 0000 
Uterus n 0.1166 
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DE 198 18 619 A 1 

Hektronischer Northern fur SEQ. ID. NO: 35 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T T/N 


Blase 


0.0273 


0.1227 


0.2225 4.4952 


Brust 


0.0243 


0.0263 


0.9236 1.0828 


Duenndarm 


0.1625 


0.0000 


undef 0.0000 


Eierstock 


0.0000 


0.0364 


0.0000 undef 


Endokrines Gewebe 


0.0000 


0.0000 


undef undef 


Gastrointestinal 


0.2146 


0.0093 


23.1935 


Gehirn 


0 0000 


0.0062 


0.0000 undef 


Haematopoetisch 


0 . 1203 


0.0000 


undef 0 . 0000 


Haut 


a 0441 


0 0000 

SJ ■ WW 


undef 0.0000 


Hepatisch 


o oooo 


0.1229 


0. 0000 undef 


Herz 


0. 0021 


0.0000 


undef 0.0000 


Hoden 


0.0000 


0.0935 


0.0000 undef 


Lunqe 


0.0353 


0.0900 


0.3926 2.5473 


Magen-Speiseroehre 


0.0000 


0.0077 


0.0000 undef 


"* Muskel-Skelett 


0.0857 


0.0120 


7.1388 0.1401 


Niere 


0.0081 


0.0753 


0.1081 9.2471 


Pankreas 


0.0529 


0.1381 


0.3829 2.6116 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0109 


0.0064 


1.7060 0.5862 


Uterus Endometrium 


0.0000 


0.0000 


undef undef 


Uterus_Myometrium 


o.oooo 


0.0000 


undef undef 


Uterus allgemein 


0.0000 


0.0000 


undef undef 


Brust-Hyperplasie 


0.0160 






Pros tata— Hype rplas ie 


0.0000 






Samenblase 


0.0000 






Sinnesorgane 


0.0118 






Weisse Blutkoerperchen 


0.0009 






— Zervix 


O 0106 






'• 


FOETUS 








SlHaeufiokeit- 






Entwicklung 


0 0000 






Gastrointenstinal 


0 0000 






Gehirn 


0 0000 






Haematopoetisch 


0.0039 






Haut 


0.0000 






Hepatisch 


0.0000 






Herz-Blutgefaesse 


0.0000 






Lunge 


0.0000 






Nebenniere 


0.0000 






Niere 


0.0000 






Placenta 


0.0000 






Prostata 


0.0000 






Sinnesorgane 


0.0000.. 







0.0431 



10 



15 



20 



25 



30 



35 



40 



45 



Brust 
Eierstock_n 
EierstockM; 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0082 

0.0000 

0.0000 

0.0000 

0.0000 



50 



55 



60 



65 



39 
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Elcktronischer Northern fur SEQ. ID. NO: 36 



10 



15 



20 



25 



30 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T T/N 


Blase 


0.0078 


0.0332 


0.2347 4.2611 


Brust 


0,0153 


0.0169 


0.9074 1.1021 


Duenndarm 


0.0092 


0.0165 


0.5561 1.7982 


Eier stock 


0.0210 


0.0130 


1.6118 0.6204 


Endokrines^Gewebe 


0.0085 


0.0150 


0.5660 1.7667 


Gastrointestinal 


0 0153 


0 0231 


0.6627 1.5090 


Gehirn 


n m rs 


•* 0.0226 


0.8181 1.2223 


Haematopoetisch 


o n??7 


0. 0379 


0.5999 1.6669 


Haut 


u • uu / .3 




undpf 0 0000 


Hepatisch 


ft ftftdR 




0 2451 4 0800 


Herz 


n ft9fn 


n oi ^7 


1.4649 0. 6827 


Hoden 


0.0115 


0.0234 


0.4920 2.0326 


Lungs 


0.0166 


0.0164 


1.0161 0.9842 


Magen- Spe is eroehre 


0.0000 


0.0153 


0.0000 undef 


Muskel-Skelett 


0.0137 


0.0060 


2.2844 0.4378 


Niere 


0.0081 


0.0274 


0.2974 3.3626 


Pankreas 


0.0050 


0.0055 


0.8974 1.1143 


Penis 


0.0180 


0.0267 


0.6739 1.4839 


Prostata 


0.0065 


0.0106 


0.6142 1.6282 


Ut erusJEndomet r ium 


0.0135 


0.0528 


0.25613.9053 


0terus_Myometrium 


0.0152 


0.0408 


0.37412.6732 


Uterus_allgemein 


0.0458 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.0256 







Prostata-Hyperplasie 0.0208 

Samenblase 0.0178 

Sinnesorgane 0.0118 

Weisse_Blutkoerperchen 0 . 0061 

Zervix 0.0106 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0417 

0.0167 

0.0250 

0.0079 

0.2513 

0.0000 

0.0142 

0.0181 

0.0000 

0.0000 

0.0242 

0.0249 

0.0000 



50 



55 



60 



65 



Brust 
Eierstock_n 
Eierstock>_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 

Haut-Muskel 

Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE / SUBTRAH IERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0136 

0.0000 

0.0101 

0.0000 

0.0082 

0.0000 

0.0000 

0.0292 

0.0077 

0.0164 

0.0110 

0.0068 

0.0000 

0.0208 



40 
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Hektronischer Northern fur SEQ. ID. NO: 37 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeiifigkeit N/T 


T/N 


Blase 


0.0078 


0.0332 


0.2347 4.2611 


Brust 


0 0026 


0.0000 

V * www 


. undef 


0.0000 


Duenndarm 


0 0000 


0.0000 

W * w w w w 


undef 


undef 


Eierstock 


0 0000 

W • W WWW 


0.0000 


undef 


undef 


Endokrines Gewebe 


0.0000 


0.0000 


undef 


undef 


Gastrointestinal 


0.0115 


0.0000 


undef 


0.0000 


Gehirn 


0 0000 


0.0000 


undef 


undef 


Haema t opoe t is ch 


0 0147 

W ♦ \JJL t ± i 


0.0000 


undef 


0.0000 


Haut 


n oooo 


0 0000 

w * www 


undef 


undef 


Hepatisch 


0 0000 


0.0000 


undef 


undef 


Herz 


0 0000 

W * W WW 


0.0000 


undef 


undef 


Hoden 


0.0000 

W • W WWW 


0.0000 


undef 


undef 


Lunge 


0,0021 


0.0020 


1.0161 0.9842 


Magen-Speiseroehre 


0.0000 


0.0000 


undef 


undef 


" Muskel-Skelett 


0.0069 


0.0000 


undef 


0.0000 


Niere 


0.0000 


0.0000 


undef 


undef 


Pankreas 


0.0017 


0.0607 


0.0272 36.7712 


Penis 


0.0000 


0.0000 


undef 


undef 


Prostata 


0 0000 

w « WWW 


0.0000 

w* wwww 


undef 


undef 


Uterus_Endometrium 


0.0000 


0.0000 


undef 


undef 


Uterus Myometrium 


0.0000 


0.0000 


undef 


undef 


Uterus_allgemein 


0.0000 


0.0000 


undef 


undef 


Brust-Hyperplasie 


0.0000 








Prostata-Hyperplasie 


0.0000 








Samenblase 


0.0000 








Sinnesorgane 


0.0000 








Weisse_Blutkoerperchen 


0.0009 








Zervix 


0.0000 









5 



10 



15 



20 



25 



30 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 35 

0.0000 

0.0000 

0.0000 

0.0039 

0.0000 40 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 45 

0.0000 

0.0000 

0.0000, 



Brust 
Eierstockjn 
EierstockM: 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/ SUBTRAHIERTE BIBLIOTEEKEN 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 



50 



55 



60 



65 
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Elektronischer Northern fur SEQ. ID. NO: 38 



5 



to 



15 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T 


T/H 


Blase 


0. 


0624 


0.2607 


0. 


2393 


4.1791 


Brust 


0. 


0051 


0.0282 


0. 


1815 


5.5104 


Duenndann 


0. 


0337 


0.0000 


undef 


0.0000 


Eierstock 


0. 


0030 


0.0937 


0. 


0320 


31.2702 


Endokrines_Gewebe 


0. 


0034 


0.0025 


1. 


3585 


0.7361 


Gastrointestinal 


0. 


1360 


0.0694 


1. 


9604 


0.5101 


Gehirn 


0. 


0015 


0.0318 


0. 


0464 


21.5290 


Haematopoetisch 


0. 


1337 


0.0000 


undef 


0.0000 


Haut 


0. 


0147 


0.0000 


undef 


0.0000 


Hepatisch 


0. 


0000 


0.4594 


0. 


0000 


undef 


Herz 


0. 


0127 


0.0137 


0. 


9252 


1.0809 


Hoden 


0. 


0000 


0.1754 


0. 


0000 


undef 


Lunge 


0. 


0540 


0.2086 


0. 


2590 


3.8610 


Magen-Speiseroehre 


0. 


0193 


0.0767 


•0. 


2521 


3.9666 


Muskel-Skelett 


0. 


1747 


0.0240 


7. 


2815 


0.1373 


Kiere 


0. 


0190 


0.0205 


0. 


9252 


1.0808 


Pankreas 


0. 


0611 


0.0773 


0. 


7906 


1.2649 


Penis 


0. 


0030 


0.0000 


undef 


0.0000 


Prostata 


0. 


0196 


0.0021 


9. 


2126 


0.1085 


Dterus_Endametrium 


0. 


0135 


0.0000 


undef 


0.0000 


Uterus_Myametrium 


0. 


0000 


0.0000 


undef 


undef 


Oterus_allgemein 


0. 


0000 


0.0000 


undef 


undef 


Brus t-Hyperplas ie 


0- 


0064 











Prostata-Hyperplasie 0.0089 
Samenblase 0.0089 
Sinnesorgane 0 . 0353 
30 Weisse_Blutkoerperchen 0.0009 
Zervix 0-0426 



FOETUS 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0061 

0.0000 

0.0000* 

NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 

%Baeufigkeit 

0.0000 

0.0000 

0.0101 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0010 

0.0000 

0.0000 

0.0000 



35 

Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 

40 Haut 
Hepatisch 

Herz-Blutgefaesse 

Lunge 

Nebenniere 

Niere 

45 Placenta 
Prostata 
Sinnesorgane 



50 

Brust 
Eierstock w n 
Eierstock_t 
55 EndokrinesjGewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
s " Haut-Muskel 
60 Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
65 Uterus n 
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Elektronischer Northern fur SEQ. ID. NO: 39 




NORMAL 


TOMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T T/N 


Blase 


0.0702 


0.2761 


0.2542 3.9333 


Brust 


0.0614 


0.0620 


0.9899 1.0102 


Duenndarm 


0.2024 


0.1489 


1.3594 0.7356 


Eierstock 


0.0060 


0.0442 


0.1354 7.3832 


Endokrines Gewebe 


0.0255 


0.0025 


10.1887 0, 


Ga s t r o int e s t inal 


0.4138 


. 0.0740 


5.5913 0.1788 


Gehirn 


0.0044 


' 0.0431 


0.1029 9.7228 


Haematopoetisch 


0.5774 


0.0000 


undef 0.0000 


Haut 


0.0257 


0.0000 


undef 0.0000 


Hepatisch 


0.0285 


0.3364 


0.0848 11.7866 


Herz 


0.0095 


0.0412 


0.2313 4.3235 


Hoden 


0.0000 


0.2689 


0.0000 undef 


Lunge 


0.1735 


0.2678 


0.64761.5441 


Magen-Speiseroehre 


0.0870 


0.1917 


0.4538 2.2037 


Muskel-Skelett 


0.2193 


0.0180 


12.1835 0 


Niere 


0.0136 


0.1164 


0.1166 8.5746 


Pankreas 


0.0677 


0.1822 


0.3717 2.6906 


Penis 


0.0030 


0.1333 


0.0225 44.5175 


Prostata 


0.0327 


0.0085 


3.83860.2605 


Uterus_Endometrium 


0.0000 


0.0000 


undef undef 


Uterus_Myometrium 


0.0000 


0.0000 


undef undef 


Uterus^ allgemein 


0.0051 


0.0000 


undef 0.0000 



Brust-Hyperplasie 0.0927 
Prostata-Hyperplasie 0. 0178 
Samenblase 0 . 02 67 
Sinnesorgane 0.2823 
Weisse_Blutkoerperchen 0.0087 
Zervix 0.0426 



0981 



0.0821 



10 



15 



20 



25 



30 



FOETUS 

%Haeufigkeit 35 

Entwicklung 0.0000 

Gastrointenstinal 0.0056 

Gehirn 0.0000 

Haematopoetisch 0.0157 

Haut 0.0000 40 

Hepatisch 0.0000 

Herz-Blutgefaesse 0.0000 

Lunge 0.0000 

Nebenniere 0.0254 

Niere 0.0000 45 

Placenta 0.0000 

Prostata 0.0000 

Sinnesorgane 0.OO0Q; 



Brust 
Eierstock^n 
Eierstock^t 
EndokrinesjGewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/SOBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.4082 

0.0000 

0.3493 

0.0000 

0.0082 

0.1953 

0.0000 

0.0000 

0.0000 

0.0491 

0.0030 

0.0000 

0.0000 

0.0000 



50 



55 



60 



65 



43 



• DE 198 18 619 A 1 # 



Elektronischer Northern fur SEQ. ID. NO: 40 







NORMAL 


TUMOR 


Verhaeltnlsse 






%Haeufigkeit 


%Haeufigkeit N/T T/N 


5 


Blase 


0.0195 


0.0690 


0.2825 3.5400 




Brust 


0.0166 


0.0320 


0.5204 1.9216 




Duenndarm 


0.0031 


0.0000 


undef 0.0000 




Eierstock 


0.0150 


0.0130 


1.1513 0.8686 




Endokrines_Gewebe 


0.0085 


0.0100 


0.8491 1.1778 




Gastrointestinal 


0.0019 


0.0093 


0.2071 4.8289 


10 


Gehirn 


0.0067 


0.0442 


0.1507 6.6362 




Haematopoetisch 


0.0187 


0.0000 


undef 0.0000 




Haut 


0.0073 


0.0000 


undef 0.0000 




Hepatisch 


0.0238 


0.0194 


1.2255 0.8160 




Herz 


0.0625 


0.1512 


0.4135 2.4182 


15 


Hoden 


0.0345 


0.0117 


2.9518 0.3388 




Lunge 


0.0322 


0.0286 


1.1249 0.8889 




Magen-Speiseroehre 


0.0000 


0.0307 


.0.0000 undef 




Muskel-Skelett 


0.0668 


0.1260 


0.5303 1.8857 




Niere 


0.0190 


0.0342 


0.5551 1.8014 


20 


Pankreas 


0.0050 


0.1160 


0.0427 23.3998 




Penis 


0.0299 


0.0000 


undef 0.0000 




Prostata 


0.0131 


0.0170 


0.7677 1.3026 




UterusJEndometrium 


0.0068 


0.0528 


0.1280 7.8106 




Uterus^Myometrium 


0.0305 


0.0204 


1.4964 0.6683 


25 


Uterus_allgemein 


0.0357 


0.0000 


undef 0.0000 




Brust-Hyperplasie 


0.0000 








Prostata-Hyperplasie 


0.0208 








Samenblase 


0.0356 








Sinnesorgane 


0.0588 






30 


Weisse_Blutkoerperchen 


0.0000 








Zervix 


0.0319 










FOETUS 






35 




%Haeuf igkeit 








Entwicklung 


0.0000 








Gastrointenstinal 


0.0167 








Gehirn 


0.0000 








Haematopoetisch 


0.0118 






An 


Haut 


0.0000 








Hepatisch 


0.0520 








Herz-Blutgefaesse 


0.0107 








Lunge 


0.0325 








Nebenniere 


0.0000 








Niere 


0.0494 






45 


Placenta 


0.0909 








Prostata 


0.0000 








Sinnesorgane 


0.0000:' 







50 



55 



60 



65 



Brust 
Eierstoclfja 
Eierstock_t 
Endokrines_Gewehe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0340 

0.0000 

0.0203 

0.0490 

0.0297 

0.0000 

0.0000 

0.0000 

0.0154 

0.0082 

0.0030 

0.0000 

0.0000 

0.0000 
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Hektronischer Northern fur SEQ. ID. NO: 41 



Blase 
Brust 
Duenndarm 
Eierstock 
EndokrinesjGewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Oterus_Endametrium. 
Uterus_Myametrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Saraenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 
%Haeufigkeit 
0.0390 
0.0269 
0.1932 
0.0000 
0.0017 
0.2203 
0.0000 
0.0882 
0.0184 
0.0000 
0.0095 
0.0000 
0.0478 
0.0193 
0.1011 
0.0054 
0.0099 
0.0000 
0.0153 
0.0068 
0000 
0000 
0192 
0119 
0000 
0000 
0009 
0213 



TOMOR 

%Haeufigkeit 

0.1355 

0.0564 

0.0000 

0.0312 

0.0025 

0.0509 

0.0103 

0.0000 

0.0000 

0.0518 

0.0137 

0.0702 

0.1472 

0.0153 

0.0120 

0.0548 

0.0552 

0.2933 

0.0021 

0.0000 

0.0068 

0.0000 



Verhaeltnisse 
N/T T/N 
0.2878 3.4744 
0.4764 2.0992 
undef 0.0000 
0.0000 undef 
0.6792 1.4722 
4.3300 0.2309 
0.0000 undef 
undef 0.0000 
undef 0.0000 
0.0000 undef 
0.6939 1-4412 
0.0000 undef 
0.3246 3.0809 
1.2605 0.7933 
8.4237 0.1187 
0.0991 10.0878 
0.1795 5.5714 
0 . 0000 undef 
7.1654 0.1396 
undef 0.0000 
0.0000 undef 
undef undef 



FOETUS 

%Haeufigkeit 
Entwicklung 0.0000 
Gastrointenstinal 0.0000 
Gehirn 0.0000 
Haematopoetisch 0.0079 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0000 
Lunge 0.0000 
Nebenniere 0.0000 
Niere 0.0000 
Placenta 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000: 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0000 
Eierstockji 0.0000 
Eierstock^t 0.0000 
Endokr ines_Gewebe 0.0000 
Foetal 0.0000 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0000 
Nerven 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 42 



10 



15 



20 



25 



30 



Blase 
Brust 
Duenndarm 
Eierstbck 
EndokrinesjGewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
" Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Uterus__Myometrium 
Uterusjallgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Saxnenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 
%Haeu£igkeit 
0.0273 
0.0665 
0.0429 
0.0539 
0.0630 
0.0441 
0554 
0454 
0257 
0381 
0.0435 
0.0575 
0.1008 
0.0580 
0.0976 
0.0516 
0.0248 
0.0599 
0.0567 
0.0405 
0.0534 
0.0866 
0.0416 
0.0654 
0.0712 
0.0823 
0.1110 
0.0319 



TUMOR 

%Haeufigkeit 
0.0895 
0.0808 
0.0000 
0.0234 
0.0978 
0.0324 
0.1006 
0.0379 
0.0000 
0.0453 
0.0825 
,0000 
.0552 
.0997 
,0660 
0.0690 
0.0773 
0.1066 
0.0766 
0.1055 
0.0475 
0.0000 



Verhaeltnisse 
N/T T/N 
0.3051 3.2777 
0.8230 1.2151 
undef 0.0000 
2.3025 0.4343 
0.6444 1.5518 
1.3608 0.7348 
0.5510 1.8149 
1.1998 0.8335 
undef 0.0000 
0.8403 1.1900 
0.5268 1.8981 
undef 0.0000 
1.8252 0.5479 
0.5818 1.7188 
1.4797 0.6758 
0.5795 1.7255 
0.3205 3.1200 
0.5616 1.7807 
0.7393 1.3527 
0.3841 2.6035 
1.1223 0.8911 
undef 0.0000 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0139 

0.0361 

0.0125 

0.0433 

0.0000 

0.0000 

0.0071 

0.0253 

0.0254 

0.0432 

0.0364 

0.0499 

0.0126 



50 NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 
Brust 0.1020 
Eierstock n 0.1595 
EierstockM: 0.0709 
55 EndokrinesjSewebe 0.0000 

Foetal 0.0635 
Gastrointestinal 0.0000 
Haematopoetisch 0.0114 
Haut-Muskel 0.0680 
60 Hoden 0.0463 

Lunge 0.0328 
Nerven 0.0351 
Prostata 0.0342 
Sinnesorgane 0 . 0464 
65 Uterus n 0.0083 
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Hektronischer Northern fur SEQ. ID. NO: 43 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrine s_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Her z 
Hoden 
Lunge 

Magen-Speiseroehre 
* Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Uterus_Myometrium 
Dterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 
%Haeufigkeit 
0.0195 
0,0345 
0.0399 
0.0150 
0.0238 
0.0862 
0.0067 
0.0080 
.0587 
.0190 
.0011 
.0000 
.0062 
0.0387 
0.0000 
0.0760 
0.0677 
0.0090 
0.0109 
0.0270 
0.0076 
0.0000 
0576 * 
0119 
1068 
0235 
0061 
0319 



TUMOR 

%Haeufigkeit 

0.0639 

0.0470 

0.1985 

0.0676 

0.0878 

0.1064 

0.0277 

0.0000 

0.0000 

0.0323 

0.0962 

0.0234 

0.0164 

0.3450 

0.0360 

0.1643 

0.0276 

0.0533 

0.0255 

0.0000 

0.0272 

0.4771 



Verhaeltnisse 
N/T T/N 
0.3051 3.2777 
0.7350 1.3606 
0.2008 4.9797 
0.2214 4.5168 
0.2717 3.6805 
0.8103 1.2341 
0.2400 4.1669 
undef 0.0000 
undef 0 . 0000 
0.5882 1.7000 
0.0110 90.7941 
0.0000 undef 
0.3810 2.6245 
0.1120 8.9248 
0.0000 undef 
0.4626 2.1617 
2.4530 0.4077 
0.1685 5.9357 
0.4265 2.3446 
undef 0.0000 
0.2806 3.5642 
0.0000 undef 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETOS 

%Haeufigkeit. 

0.0000 

0.0111 - 

0.0813 

0.0000 

0.0000 

0.0000 

0.0000 

0.0145 

0.0000 

0.0309 

0.0121 

0.0249 

0.0000 



NORMIERTE/SDBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0136 

Eierstock_n 0.0000 

Eierstock^t 0.0101 

EndokrinesjGewebe 0.0000 

Foetal. 0.0122 

Gastrointestinal 0.4149 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0154 

Lunge 0.0573 

Nerven 0.0040 

Prostata 0.0068 

Sinnesorgane 0.0000 

Uterus n 0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 44 



10 



15 



20 



25 



30 



Blase 
Brust 
Duenndarm 
Eierstock 
EndokrinesjGewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
■ Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Uterus_Myometrium 
Uterus — allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 
%Haeufigkeit 
0.0156 
.0166 
.0828 
.0030 
.0017 
.1226 
0.0000 
0.0481 
0.0294 
0.0000 
0.0032 
0.0000 
0.0447 
0.0097 
0.0497 
0.0054 
0.0215 
0.0030 
0.0044 
0.0000 
0.0000 
0.0000 
0.0352 
0.0030 
0.0000 
0.0000 
0.0000 
0.0106 



TUMOR 

%Haeufigkeit 

0.0511 

0.0320 

0.0000 

0.0416 

0.0050 

0.0278 

0.0123 

0.0000 

0.0000 

0.1294 

0.0137 

0.1403 

0.1390 

0.0000 

0.0060 

0.0685 

0.0166 

0.0000 

0.0043 

0.0000 

0.0000 

0.0000 



Verhaeltnisse 
N/T T/N 
0.3051 3.2777 
0.5204 1.9216 
undef 0.0000 
0.0720 13.8979 
0.3396 2.9444 
4.4178 0.2264 
0 . 0000 undef 
undef 0.0000 
undef 0.0000 
0.0000 undef 
0.2313 4.3235 
0.0000 undef 
0.3213 3.1128 
undef 0.0000 
6.2810 0.1208 
0.0793 12.6097 
1.2963 0.7714 
undef 0.0000 
1.0236 0.9769 
undef undef 
undef undef 
undef undef 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit. 

0.0000 

0.0000 

0.0000 

0.0079 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0. 0000 

0.0000 



50 



55 



60 



65 



Brust 
Eierstock_n 
EierstockM; 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0136 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 
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Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Ho.den 
Lunge 

Magen-Speis eroehre 
• Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Uterus_Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasia 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



Elektronischer Northern for SEQ. ID. NO: 45 


NORMAL 


TUMOR 


Verhaeltnisse 


%Haeufigkeit 


%Haeufigkeit N/T T/N 


0,0117 


0.0383 


0.3051 3.2777 


0.0179 


0.0207 


0.8661 1.1546 


0.0245 


0 0165 


1.4830 0.6743 


0.0329 


0. 0156 


2.1106 0.4738 


0.0221 


0. 0326 


0.6792 1.4722 


0.0153 


ft m *}Q 


1 1045 0 9054 


0.0296 




1.0285 0. 9723 


0.0187 


a 0.V7Q 


0 4940 2.0241 


0.0257 




undef 0.0000 


0.0048 




0 3676 2.7200 


0.0201 


ft 0550 


0.3662 2.7306 


0.0058 


0 0117 


0.4920 2.0326 


0.0218 


0.0225 


0.9699 1.0311 


0.0387 


0.0383 


.1.0084 0.9916 


0.0171 


0.0240 


0.7139 1.4008 


0.0190 


0.0068 


2.7756 0.3603 


0.0116 


0.0221 


0.5235 1.9102 


0.0090 


0.0000 


undef 0.0000 


0.0327 


0.0319 


1.0236 0.9769 


0.0068 


0.0000 


undef 0.0000 


0.0000 


0.0136 


0.0000 undef 


0.0560 


0.0000 


undef 0.0000 



0.0032 
0.0208 
0-0178 
0.0118 
0.0260 
0.0000 



FOETUS 

%Haeuf igkeit . 
Entwicklung 0.0000 
Gastrointenstinal 0.0139 
Gehirn 0.0313 
Haematopoetisch 0 . 0039 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgef aesse 0 . 0071 
Lunge 0.0217 
Nebenniere 0.0000 
Niere 0.0000 
Placenta 0.0424 
Prostata 0.0249 
Sinnesorgane 0.0000 

NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0408 
Eierstockja 0.0000 
Eierstock^t 0.0405 
Endokrines_Gewebe 0.0245 
Foetal 0.0087 
Gastrointestinal 0.0000 
Haematopoetisch 0.0057 
Haut-Muskel 0.0097 
Hoden 0.0231 
Lunge 0.0082 
Nerven 0.0251 
Prostata 0.0205 
Sinnesorgane 0.0000 
Uterus n 0.0125 
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Elektronischer Northern fur SEQ. ID. NO: 46 



10 



15 



20 



25 



30 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeu£igkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0. 


1170 


0.3067 


0.3814 


2. 


6222 


Brust 


0. 


3019 


0.23B7 


1.2646 


0. 


7908 


Duenndann 


1. 


1559 


0.0000 


undef 


0. 


0000 


Eierstock 


0. 


0120 


0.0676 


0.1771 


5. 


6460 


Endokrines_Gewebe 


0. 


0034 


0.0125 


0.2717 


3. 


6805 


Gastrointestinal 


1. 


2798 


0.1804 


7.0940 


0. 


1410 


Gehirn 


0. 


0007 


0.0380 


0.0195 


51 


.3918 


Haematopoetisch 


0. 


4785 


0.0000 


undef 


0. 


0000 


Haut 


0. 


1322 


0.0000 


undef 


0. 


0000 


Hepatisch 


0. 


0285 


0.4594 


0.0621 


16 


1.0932 


Herz 


0. 


0138 


0.0275 


0.5011 


1. 


9955 


Hoden 


0. 


0000 


0.2456 


0.0000 


undef 


Lunge 


0. 


3625 


0.3435 


1.0554 


0. 


9475 


Magen-Speiseroehre 


0. 


0483 


0.1533 


0.3151 


3. 


1733 


• Muskel-Skelett 


0. 


2124 


0.0480 


'4.4260 


0. 


2259 


Niere 


0. 


0163 


0.1917 


0.0850 


11 


.7691 


Pankreas 


0. 


1074 


0.4528 


0.2371 


4. 


2171 


Penis 


0. 


0000 


0.0533 


0.0000 


undef 


Prostata 


0. 


0806 


0.0255 


3.1562 


0. 


3168 


Uterus JBndometrium 


0. 


0068 


0.0000 


undef 


0. 


0000 


Uterus_Myometrium 


0. 


0152 


0.0000 


undef 


0. 


0000 


Uterus_allgemein 


0. 


0102 


0.0000 


undef 


0. 


0000 


Brust-Hyperplasie 


0. 


3805 











Prostata-Hyperplasie 0 . 0505 

Samenblase 0.0356 

Sinnesorgane 0 . 5175 

Weisse_Blutkoerperchen 0.0078 

Zervix 0.0213 



FOETUS 

35 %Haeufigkeit. 

Entwicklung 0.0000 

Gastrointenstinal 0.0000 

Gehirn 0.0000 

Haematopoetisch 0.0039 

40 Haut 0.0000 

Hepatisch 0.0000 

Herz-Blutgefaesse 0.0000 

Lunge 0.0000 

Nebenniere 0.0000 

45 Niere 0.0000 

Placenta 0.0303 

Prostata 0.0000 

Sinnesorgane 0.0000 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.7687 
Eierstockji 0.0000 
Eierstocfrt 0.1924 
EndokrinesJSewebe 0.0000 
Foetal 0.0029 
Gastrointestinal 0.0976 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0328 
Nerven 0.0020 
Prostata 0.0000 
Sinnesorgane 0 . 0000 
Uterus n 0.0000 



50 



DE 198 18 619 A 1 



Blase 
Brust 
Duenndarm 
Eierstock 
EndokrinesjGewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
" Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
UtenisJEndometrium 
Uterus_Myometrium 
Uterus_allgemein 
Brust-Byperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



Elektronischer Northern fur SEQ. ID. NO: 47 


NORMAL 


TUMOR 


Verhaeltnisse 


%Haeufigkeit %Haeufigkeit N/T T/N 


0.0195 


U . U4oD 


n 4014 2. 4911 


0. 0550 


. U. / 


n 4241 2 3580 


0. 0123 


O 1 CLKA 
U • 


n 0741 13. 4866 


0. 0449 


n motJ 
U • U / £o 


n fil67 1 6214 


0.0119 


0. UU /O 


1 ^R4Q 0 6309 


0.0153 


0. lOlo 




0.0037 


0.0359 




0.0134 


0.0000 


under u • uuuu 


0.0404 


0.0847 


0.4/DO Z.UyoO 


0.0285 


0.0323 


0. 8824 1. Xooo 


0.0233 


0.0825 


0.2827 3.5374 


0.0460 


0.0351 


1.3119 0-7522 


0.0551 


0.04 31 


1 1 21 Q 0 8913 


0.0000 


0.0613 


0.0000 undef 


0.1148 


0.3120 


0.3679 2.7180 


0.0271 


0.0137 


1.9826 0.5044 


0.0116 


0.0884 


0.1309 7.6408 


0.0180 


0.0533 


0.3369 2.9678 


0.0087 


0.0319 


0.2730 3.6634 


0.0270 


0.0000 


undef 0.0000 


0.0610 


0.1902 


0.3206 3.1187 


0.0509 


0.0000 


undef 0.0000 



0.0288 
0.0089 
0.0178 
0.0118 
0.0000 
0.0532 



FOETUS 

%Haeufigkeit 
Entwicklung 0.1809 
Gastrointenstinal 0.1388 
Gehirn 0.0063 
Haematopoetisch 0.0551 
Haut 0.2513 
Hepatisch 0.0000 
Herz-Blutgef aesse 0 . 0605 
Lunge 0.1409 
Nebenniere 0.2535 
Niere 0.1235 
Placenta 0.0364 
Prostata 0.0748 
Sinnesorgane 1.3934.= 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0204 
Eierstock_n 0.0000 
Eierstock^t 0.0304 
Endokrines_Gewebe 0.0000 
Foetal 0.0315 
Gastrointestinal 0.0122 
Haematopoetisch 0.0000 
Haut-Muskel 0.0518 
Hoden 0.0231 
Lunge 0.0000 
Nerven 0.0040 
Prostata 0.0137 
Sinnesorgane 0.0000 
Uterus n 0.0083 
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Elektronischer Northern fur SEQ. ID. NO: 48 



5 



10 



15 



20 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T 


T/N 


Blase 


0.0624 


0.1380 


0.4520 


2. 


2125 


Brust 


0.0345 


0.0564 


0.6125 


1. 


6327 


Duenndarm 


0.0491 


0.0165 


2.9659 


0. 


3372 


Eierstock 


0.0689 


0.0520 


1.3239 


0. 


7553 


EndokrinesjGewebe 


0.0392 


0.0276 


1.4202 


0. 


7041 


Gastrointestinal 


0.0460 


0.0648 


0.7100 


1. 


4084 


Gehirn 


0.0333 


0.0678 


0.4909 


2. 


0372 


Haematopoetisch 


0.0374 


0.0758 


0.4940 


2. 


0241 


Haut 


0.0257 


0.1695 


0.1516 


6. 


5954 


Hepatisch 


0.1142 


0.0518 


2.2059 


0. 


4533 


Herz 


0.0774 


0.7010 


0.1104 


9. 


0616 


Hoden 


0.2589 


0.1520 


1.7030 


0. 


5872 


Lunge 


0.0540 


0.0491 


1.1007 


0. 


9085 


Magen-Speiseroehre 


0.1256 


0.2070 


0.6069 


1. 


6477 


Muskel-Skelett 


0.1542 


0.2100 


0.7343 


1. 


3619 


Niere 


0.0109 


0.0959 


0.1133 


8. 


8268 


PanJcreas 


0.0198 


0.1270 


0.1561 


6. 


4071 


Penis 


0.0359 


0.0800 


0.4493 


2. 


2259 


Prostata 


0.0785 


0.0511 


1.5354 


0. 


6513 


Uterus_Endometrium 


0.0338 


0.0528 


0.6402 


1. 


5621 


Uterus_Myometrium 


0.0457 


0.0679 


0.6734 


1. 


4851 


Uterus_allgemein 


0.0764 


0.0954 


0.8006 


1. 


2490 



Prostata-Hyperplasie 0.0624 
Samenblase 0.0445 
Sinnesorgane 0.0118 
30 WeisseJBlutkoerperchen 0.0529 
Zervix 0.0532 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit. 

0.0139 

0.0167 

0.0000 

0.0039 

0.0000 

0.0000 

0.0249 

0.0108 

0.0000 

0.0000 

0.3333 

0.1995 

0.0000. 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.1156 
Eierstock_n 0.0000 
Eierstoc)e_t 0.1873 
Endokrines_Gewebe 0.0000 
Foetal 0.0181 
Gastrointestinal 0.0000 
Haematopoetisch 0.0057 
Haut-Muskel 0.0486 
Hoden 0.0000 
Lunge 0.0328 
Nerven 0.0020 
Prostata 0.0274 
Sinnesorgane 0.0155 
Uterus n 0.0541 
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Elektronischer Northern fur SEQ. ID. NO: 49 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0.0585 


0.1278 


0.4576 


2. 


1852 


Brust 


0.0742 


0.0507 


. 1.4619 


0. 


6841 


Duenndana 


0.1134 


0.0000 


undef 


0. 


0000 


Eierstcck 


0.0659 


0.0390 


1.6885 


0. 


5922 


Endokrines Gewebe 


0.0324 


0.0176 


1.8437 


0. 


5424 


Gastrointestinal 


0.0690 


0.0694 


0.9940 


1. 


0060 


Gehirn 


0.0325 


0.0893 


0.3641 


2. 


7464 


Haematopoetisch 


0.0842 


0.0000 


undef 


0. 


0000 


Haut 


0.0404 


0.0000 


undef 


0. 


0000 


Hepatisch 


0.1855 


0.0647 


2.8677 


0. 


3487 


Herz 


0.0435 


0.0962 


0.4516 


2. 


2145 


Hoden 


0.0230 


0.0585 


0.3936 


2. 


5408 


Lunge 


0.2275 


0.1063 


2.1396 


0. 


4674 


Magen-Speiseroehre 


0.0483 


0.1993 


0.2424 


4. 


1252 


". Muskel-Skelett 


0.0857 


0.0720 


1.1898 


0. 


8405 


Niere 


0.0706 


0.0274 


2.5774 


0. 


3880 


Pankreas 


0.0347 


0.2927 


0.1185 


8. 


4367 


Penis 


0.0090 


0.0267 


0.3369 


2. 


9678 


Prostata 


0.0458 


0.0319 


1.4331 


0. 


6978 


Uterus_EndQmetrium 


0.0270 


0 0000 


undef 


0. 


0000 


Uterus_Myametrium 


0.0610 


0.0204 


2.9927 


0. 


3341 


Uterus allgemein 


0.0917 


0 0000 


undef 


0. 


0000 


Brust-Hyperplasie 


0.0416 










Pro s t a t a— Hype rplasie 


0.0386 










Samenblase 


0.0267 










Sinnesorgane 


0.1647 










Weisse Blutkoerperchen 


0.2497 










~~ Zervix 


0.0000 












FOETUS 












%Haeufigkeit 










Entwicklung 


0.0000 










Ga s t ro int ens t inal 


0.0916 










Gehirn 


0.0500 










Haematopoetisch 


0.0865 










Haut 


0.0000 










Hepatisch 


0.0520 










Herz-Blutgefaesse 


0.0178 










Lunge 


0.0434 










Nebenniere 


0.0000 










Niere 


0.0432 










Placenta 


0.1636 










Prostata 


0.0000 










Sinnesorgane 


0.0000. 











NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.1156 
Eierstock n 0.0000 
Eierstoc)f]t 0.0354 
Endokrines_Gewebe 0.0490 
Foetal 0.0571 
Gastrointestinal 0.0854 
Haematopoetisch 0.0057 
Haut-Muskel 0.0097 
Hoden 0.0154 
Lunge 0.0491 
Nerven 0.0562 
Prostata 0.0615 
Sinnesorgane 0 . 0000 
Uterus_n 0.1957 
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Elektronischer Northern fur SEQ. ID. NO: 50 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Dterus_Myometrium 
Dterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
30 Weisse_Blutkoerperchen 
Zervix 



10 



15 



20 



25 



NORMAL 


TUMOR 


Verhaeltnisse 


%Haeufigkeit %Haeufigkeit N/T T/N 


0.0546 


0.1099 


0.4967 2.0135 


0 0563 


0.0489 


1.1517 0.8683 


0.1380 


0.0331 


4.1708 0.2398 


0.0599 


0.0650 


0.9210 1.0858 


0 0238 


0.0351 


0.6792 1.4722 


0.1322 


0.1804 


0-7328 1.3647 


0.0229 


0.0452 


0.5072 1.9714 


0.0241 


0.0000 


undef 0.0000 


0.1689 


0.0000 


undef 0.0000 


0.0285 


0.0518 


0.5515 1.8133 


0.1166 


0.1649 


0.7067 1.4150 


0.0115 


0.0117 


0.9839 1.0163 


0.1070 


0.1329 


0.8050 1.2422 


0.1450 


0.0613 


•2.3635 0.4231 


0.0685 


0.0240 


2.8555 0.3502 


0.0570 


0.0753 


0.7570 1.3210 


0.0165 


0.1491 


0.1108 9-0256 


0.0779 


0.0267 


2.9202 0.3424 


0.0610 


0.0255 


2.3885 0.4187 


0.0338 


0.0000 


undef 0.0000 


0.0991 


0.0340 


2.9179 0.3427 


0.0509 


0.1908 


0-2669 3.7471 



0.0064 
0.0386 
0.0801 
0.0588 
0.0616 
0-1810 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0139 

0.0194 

0.0000 

0-0275 

0.0000 

0.0000 

0.0142 

0.0145 

0.0000 

0.0247 

0. 0364 

0.0499 

0.0000* 



50 



55 



60 



65 



normierte/ subtrahierte bibliotheken 

%Haeufigkeit 
Brust 0.0068 
Eierstockja 0.0000 
Eierstock__t 0-0101 
Endokrines_Gewebe 0.0000 
Foetal 0.0064 
Gastrointestinal 0.0976 
Haematopoetisch 0.0057 
Haut-Muskel 0.0259 
Hoden 0-0309 
Lunge 0.1802 
Nerven 0.0050 
Prostata 0.0274 
Sinnesorgane 0-0000 
Uterus n 0.0125 
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2.2 Fisher-Test 

Um zu entscheiden, ob eine Partial-Sequenz S eines Gens in einer Bibliothek fur Normal-Gewebe signifikant haufiger 
oder seltener vorkommt als in einer Bibliothek fur entartetes Gewebe, wild Fishers Exakter Test, ein statistisches Stan- 
dardverfahren (Hays, W. L., (1991) Statistics, Harcourt Brace College Publishers, Fort Worth), durchgefuhrt 

Die Null-Hypothese lauteC die beiden Bibliotheken konnen beziiglich der Haufigkeit zu S homologer Sequenzen nicht 
unterschieden werden. Falls die Null-Hypothese mit hinreichend hoher Sicherheit abgelehnt werden kann, wird das zu S 
gehorende Gen als interessanter Kandidat fur ein Krebs-Gen akzeptiert, und es wird im nachsten Schritt versucht, eine 
Verlangerung seiner Sequenz zu erreichen. 



10 



Beispiel3 

Automatische Verlangerung der Partial-Sequenz 
Die automatische Verlangerung der Partial-Sequenz S vollzieht sich in drei Schritten: 15 

1. Ermittlung aller zu S homologen Sequenzen aus der Gesamtmenge der zur \ferfugung stehenden Sequenzen mit 
Hilfe von BLAST 

2. Assemblierung dieser Sequenzen mittels des Standardprogramms GAP4 (Bonfield, J. K., Smith, K. R, und Sta- 
den R. (1 995), Nucleic Acids Research 23 4992-4999) (Contig-Bildung). 20 

3. Berechnung einer Konsens-S equenz C aus den assemblierten Sequenzen. 

Die Konsens-Sequenz C wird im allgemeinen langer sein als die Ausgangssequenz s. Ihr elektronischer Northern-Blot 
wird demzufolge von dem fur S abweichen. Ein emeuter Fisher-Test entscheidet, ob die Altemativ-Hypothese der Ab- 
weichung von einer gleichmaBigen Expression in beiden Bibliotheken aufrechterhalten werden kann. 1st dies der Fall, 
wird versucht, C in gleicher Weise wie S zu verlangern. Diese Iteration wird mit der jeweils erhaltenen Konsensus-Se- 
quenzen Q (i: Index der Iteration) fortgesetzt, bis die Altemativ-Hypothese verworfen wird (if Ho Exit; Abbruchkrite- 
rium I) oder bis keine automatische Verlangerung mehr moglich ist (while Q > Cn*, Abbruchkriterium II). 

Im Fall des Abbruchkriteriums H bekommt man mit der nach der letzten Iteration vorliegenden Konsens-Sequenz eine 
komplette oder annahemd komplette Sequenz eines Gens, das mit hoher statistischer Sicherheit mit Krebs in Zusammen- 30 
hang gebracht werden kann. 

Analog der oben beschriebenen Beispiele konnten die in der Tabelle I beschriebenen Nukleinsaure-Sequenzen aus 
Blasentumorgewebe gefunden werden. 

Ferner konnten zu den einzelnen Nukleinsaure-Sequenzen die Peptidsequenzen (ORFs) bestimmt werden, die in der 
Tabelle II aufgelistet sind, wobei wenigen Nukleinsaure-Sequenzen kein Peptid zugeordnet werden kann und einigen 35 
Nukleinsaure-Sequenzen mehr als ein Peptid zugeordnet werden kann. We bereits oben erwahnt, sind sowohl die ermit- 
telten Nukleinsaure-Sequenzen, als auch die den Nukleinsaure-Sequenzen zugeordneten Pteptid-Sequenzen Gegenstand 
der vorliegenden Erfindung. 

40 
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Chromosomale 
Lokalisatlon 




unbekannl 


unbekannt 


unbekannt 


unbekannt 




unbekannt 


unbekannt 


unbekannt 


unbekannt 


unbekannt 


unbekannt 


Chromosom 7 1 






1 unbekannti 




c 

CO 

J* 

CD 

c 




unbekannt 






unbekannt 






1 unbekannt) 


I unbekannti 








Lange der 
angemeldeten 
Sequenz in 
Basen 


CM 
O 
CM 


1 19261 


CM 
CO 


CO 

5> 


1146 


2407 


1471 


I 17321 


o> 

CO 

o> 


O 
IO 

T- 


I 14671 


m 

CD 

00 


N- 
CD 


to 


I 18991 


CO 

m 


s 

CO 


Si 

co 


I 21901 


I 25651 


CO 


| 2096 I 


I 13481 


00 
IO 
CO 


CD 
CO 


I 16321 


| 2972] 


CO 
CD 


N- 

a> 

CO 


K 


nge des 
Basen 


CM 


h» 

CO 
CM 


CO 
CD 

T— 


in 

CD 
CM 


CO 

o 

CO 


CO 

n- 






CO 
CO 

T— 


o> 


o 
in 


CO 
CO 
CM 




CO 

Si 


CO 
CD 


CO 
CD 
r- 


00 

a 


CO 
CM 
t— 


o 

£2 

CM 


o 

CO 




N- 
CM 
CM 


CO 
CM 


CO 


CD 

o 

CM 


o> 

00 


CD 


ID 
CO 

T— 


o 

CM 


CM 

T- 

CM 


CO 
CM 


CD 
D> 
CO 
































































Funktion 


Identisch zum humanen IgO aUs V-D-J6 Region | 


unbekannti 


unbekannti 


unbekannti 


unbekannti 


identisch zum Kaposi Sarcoma-assoziierten Herpesvirus 

Glycoprotein M 


Hyaluronectln (HN) 1st ein Giykoproteln, dass Hyaluron blndet 
und oft In humanen Tumoren gefunden wird 


unbekannti 


1 unbekannti 


unbekannt 


unbekannt 


I unbekannti 


mitochondrlaies Enzym I 


Identisch zum humanen Collagen 1 (alpha-1 Kette) 


identisch zum huamen Keratin K7 (Typ II) 


unbekannti 


identisch zum humanen Cofllln 


das H19 Gen wlrd nur vom maternalen Chromosom exprimiert 
und stellt mogllcherwelse ein Tumpr-Supressorgen dar 


| Identisch zum humanen IGF-2 Wach&tumsfaktor 


1 unbekannt 


I Identisch zum humanen IgV-L (Klon VL 29-1) 


I Identisch zum humanen H19 


I mitochondrlaies Enzym 


I identisch zum humanen Anti-Hepatitis A IgG Variable Region 


I identisch zum humanen Saposln 


| Gen, dass durch IL6 induzlert wird 


I unbekannt 


| identisch zu Immunoglobulin M schwere Kette V Region 


| Identisch zur humanen lg schweren Kette (variable Region) 


Identisch zur cDNA, die die leichte Kette eines monoklonalen 
Antikorpers kodiert, der gegen das humane Cytomegalovirus 

65 kD Protein gerichtet ist 


WahrscheinllchkeitfQr 
eine spezlfische 
Expression Im 
Tumorgewebe % 


S 
o> 

CD 


99.51 


99.2 


98.56 


97.43 


CO 


' 95.29 


95.29 


85.29 


91.23 


91.23 


CO 
CM 
T-r 
CD 


I 91.231 


91.23 


o 
o 


O 

o 

T— 


99.97 


o 
o 


O 

o 


I 99.561 


I 99.26 1 


o 
o 


I 98.771 


o 

O 


I 96.49 


1 94.05 


I 94.051 


o 
o 

T- 


o 
o 

T- 


§ 


Sequenz 
ID No.: 




CM 


CO 
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CO 




CO 


CD 
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T- 


CM 


CO 




in 


CO 


n. 


CO 


a> 


o 

CM 


CM 


CM 
CM 


CO 
CM 


Si 
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CM 


CD 
CM 


N» 
CM 


CO 
CM 


CD 
CM 
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Chromosomale 
Lokalisation 


unbekannt 


unbekannt 








unbekannt 












r 






unoeKannt 








■» 




Lange der 
angemeldeten 
Sequenz in 
Basen 


co 
o 


CO 


m 

CO 


CO 
CM 
CO 


m 


1203 


o 

CM 


CO 

S 


CO 

Nj 
O) 


s 

CM 




ID 
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CM 


CO 




i 


CO 

ro 


a> 

CD 
D> 


5 


3 


Lange des 
Ausgangs-EST in 
Basen 


o 

CM 


co 


O) 

co 


o> 


CO 
CM 


CO 

o 

CM 


h- 
O 
CM 


in 

CO 


h- 

CO 
CM 


CO 




CO 

o> 


co 
s. 


CM 
CM 


o 

CM 
CM 


OOZ 


CM 
CO 
CM 


O 

X> 


o 


O 

CM 


Funktion 


unbekannt | 


mitochondrlales Enzym | 


identisch zur Ig kappa leichten Kette variable Region D1 1.| 


identisch zum humanen hsp27| 


identisch zur Ig schweren Kettevariablen Region V(2-1) (v(h)-lv 

Famille). 


unbekannt) 


identisch zur Immunoglobulin lleichten Kette variable Region 
(Iambda-lllb Untergruppe) von IgM Rheuma-bezogener 

Faktoren. 


identisch zu US-Patent | 


identisch zu einer Immunoglobulin lambda leichten Kette 

(X57812) 


identisch zur humanen hPGi mRNA, die das "bone small 

proteoglycan P kodlert 


I identisch zu patentierter Sequenz 


I identisch zu patentierter Sequenz 


I Identisch zur CD24 "signal transducer mRNA" 


identisch zur humanen Ig schweren Kette variabien Region 

(Klon M49) 


I unbekannt 


Identisch zur Ig Alpha 1- Alpha 2m=lmmunoglobulin A1-A2 
lambda hybrid GAU schwere Kette {secreted alpha chain} 


I identisch zu Pro-alpha 2(l) collagen (COL1 A2) Gen 


1 Identisch zu humanen NIC (Natural Inhibitor of Collagenase] 


1 - identisch zur humanen Ferritin L Kette 


Identisch zum humanen Calcyclin Gen (auch Prolactin- 
Rezeptor assozllertes Protein) 


Wahrscheinlichkeit fur 
eine spezlfische 
Expression im 
Tumorgewebe % 


90.26 


90.26 


O 

o 


99.76! 


o 
o 


9.3.95 


' 93.95 


o 
o 


o 
o 


99.42 


99.99 


99.77 


99.11 


97.85 


94.61 


o 
o 

v- 


93.7 


99.54 


99.28 


97,9 


Sequenz 
ID No.: 
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TABKT.T.F.n 



DNA-Sequenzen 


Peptid-Sequenzen (ORPs) 


Seq. ID. No. 


Seq. ID. No. 


2 


128 






129 






130 
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131 






132 






133 
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134 






135 






136 
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137 






138 






139 
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140 
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142 
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143 
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10 
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13 
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157 
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16 


159 






160 






161 
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DNA-Sequenzen 


Peptid-Sequenzen (ORFs) 


Seq. ID. No. 


Seq. ID. No. 


18 


162 






163 






164 




1 


165 




20 


166 






167 






168 




23 


169 






170 




26 


171 






172 




27 


. 173 




31 


174 




32 


175 






176 






177 




36 


178 






179 






180 




45 


181 






182 






183 





Die erfinderischen Nukleinsaure-Sequenzen Seq. ID No. 1 bis Seq. ID No. 50 der ermittelten Kandidatengene und die 
ermittelten Aminosaure-Sequenzen Seq. ID No. 51-106 werden in dem nachfolgenden SequenzprotokoU beschrieben. 

SequenzprotokoU 

(1) ALLGEMEINE INFORMATION: 

(i) ANMELDER: 

(A) NAME: metaGen - Gesellschaft fur Genomforschung mbH 

(B) 'STRASSE: Ihnestrasse 63 

(C) STADT: Berlin 

(E) LAND: Deutschland 

(F) POST CODE (ZIP): D-14195 

(G) TELEFON: (030)-8413 1673 

(H) TELEFAX: (030)-8413 1674 
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(ii) TITEL DER ERFINDUNG: Menschliche Nukleinsaure-Sequenzen aus 

Blasentumorgewebe 

(Hi) Anzahl der Sequenzen: 106 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0. Version #1 .25 (EPO) 

(2) INFORMATION UBER SEQ ID NO: 1 : 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 202 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 

(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vK) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 1 

tgagagtcat ggacctcctg cacaagaaca tgaaacacct gtggttcttc ctcctcctgg 60 
tggcagctcc _cagatgggtc ctgtcccagg tgcagctaca gcagtggggc gcaggactgtl20 
tgaagccttc ggagaccctg tccctaacat gcgctgtctc cggtgactct tccagtacttl80 
actactggga ttggatccgc ca 202 

(2) INFORMATION UBER SEQ ID NO: 2: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1926 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 
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(iii) HYPOTHETISCH: NEIN 
(in) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 2 

ttgcgatggc tgatggactg tggctctcta accaaaggac cctagcgggc tcaacaattg 60 
tcaagagcag ttggtggttc tgaatacaat cctcagccaa ggatccctcc tgtgttacag 120 
atggatcagc taaaacaagc caacactgaa gacacaaaga atgaggttag gttcattgaa 180 
accagggtaa cacctgtgga tgagctaaac acaaagatga caatgacctt gtaccaggta 240 
tagaagctca gagacatgcc tgcaaaatga aatccctgag gaattttgca gctacccaga 300 
gatacgtggt tcaaattaaa atgtctgacg gatcactcat ttgaggaaca gcacatcagc 360 
ttcgcccttt acgtggacaa taggtttttt actttgacgg tgacaagtct ccacctggtg 420 
ttccagatgg gagtcatatt cccacaataa gcagccctta ctaagccgag agatgtcatt 480 
cctgcaggca ggacctatag gcacgtgaag atttgaatga aagtacagtt ccatttggaa 540 
gcccagacat aggatgggtc agtgggcatg gctctattcc tattctcaaa ccatgccagt 600 
ggcaacctgt gctcagtctg aagacaatgg acccacgtta ggtgtgacac gttcacataa 660 
ctgtgcagca catgccggga gtgatcagtc agacatttta atttgaacca cgtatctctg 720 
ggtagctaca aaattcctca gggatttcat tttgcaggca tgtctctgag cttctatacc 780 
tgctcaaggt cagtgtcatc tttgtgttta. gctcatccaa aggtgttacc ctggtttcaa 840 
tgaacctaac ctcattcttt gtgtcttcag tgttggcttg ttttagctga tccatctgta 900 
acacaggagg gatccttggc tgaggattgt atttcagaac caccaactgc tcttgacaat 960 
tgttaacccg ctaggctcct ttggttagag aagccacagt ccttcagcct ccaattggtg!020 
tcagtactta ggaagaccac agctagatgg acaaacagca ttgggaggcc ttagccctgcl080 
tcctctcaat tccatcctgt agagaacagg agtcaggagc cgctggcagg agacagcatgll40 
tcacccagga ctctgccggt gcagaatatg aacaatgcca tgttcttgca gaaaacgct.tl200 
agcctgagtt tcataggagg taatcaccag acaactgcag aatgtagaac actgagcaggl260 
acaactgacc tgtctccttc acatagtcca tatcaccaca aatcacacaa caaaaaggagl320 
aagagatatt ttgggttcaa aaaaagtaaa aagataatgt agctgcattt ctttagttatl380 
tttgaagccc caaatatttc ctcatctttt tgttgttgtc atgigatggtg gtgacatggal440 
cttgtttata gaggacaggt cagctgtctg gctcagtgat ctacattctg aagttgtctglSOO 
aaaatgtctt catgattaaa ttcagcctaa acgttttgcc gggaacactg cagagacaat!560 
gctgtgagtt tccaacctca gcccatctgc gggcagagaa ggtctagttt gtccatcaccl620 
attatgatat caggactggt tacttggtta aggaggggtc taggagatct gtcccttttal680 
gagacacctt acttataatg aagtacttgg gaaagtggtt ttcaagagta taaatatcctl740 
gtattctaat gatcatcctc taaacatttt atcatttatt aatcctccct gcctgtgtctl800 
attattatat tcatatctct acgctgcaaa ctttctgcct caatgtttac tgtgcctttgl8 60 
tttttgctag tgtgtgttgt tgaaaaaaaa aacattccct gcctaagtta gttttggcaal920 
agtatt 1926 



(2) INFORMATION UBER SEQ ID NO: 3: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) tANGE: 762 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 



(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editieaing 
hergestellte partielle cDNA 
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(Hi) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 3 

ctccactgca accacccaga gccatggctc cccgaggctg catcgtagct gtctttgcca 60 
ttttctgcat ctccaggctc ctctgctcac acggagcccc agtggccccc atgactcctt!20 
acctgatgct gtgccagcca cacaagagat gtggggacaa gttctacgac cccctgcagcl80 
actgttgcta tgatgatgcc gtcgtgccct tggccaggac ccagacgtgt ggaaactgca240 
ccttcagagt ctgctttgag cagtgctgcc cctggacctt catggtgaag ctgataaacc300 
agaactgcga ctcagcccgg acctcggatg acaggctttg tcgcagtgtc agctaatgga360 
acatcagggg aacgatgact- cctggattct ccttcctggg tgggcctgga gaaagaggct420 
ggtgttacct gagatctggg atgctgagtg gctgtttggg ggccagagaa acacacactc480 
aactgcccac ttcattctgt gacctgtctg aggcccaccc tgccgctgcc ctgaggaggc540 
ccacaggtcc ccttctagaa ttctggacag catgagatgc gtgtgctgat gggggcccag600 
ggactctgaa ccctcctgat gacccctatg gccaacatca acccggcacc accccaaggc660 
tggctgggga acccttcacc cttctgtgag attttccatc atctcaagtt ctcttctatc720 
caggagcaaa gcacaggatc ataataaatt- tatgtacttt aa 762 

(2) INFORMATION OBER SEQ ID NO: 4: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 918 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 4 

ctcgagccgc tcgagccgat tcggctcgag ccttcccgct ccctgcttgc aaagtggttg 60 
tgccccaagg tccgcctcca ggccacgtgg gtgctgcggg ccaagctttc ccttcctttgl20 
agagaggttt ccgctgtagg agcagagctt ccgggctgcg ctcttcgttg cccagtttccl80 
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gctcagtggt cgcgtctccg 
taggcgccat ggctccccgc 
ccgagaagcg agagaagctg 
tcgagcattg cactagctga 
tgcgcctgga ggccccagag 
cttcgaggtg acgctgctgc 
gaaggggccc ccacgcaaac 
gaagtacctg tcgtagggag 
tgatgttgga acattaatga 
gtttcttccc tgccagaaat 
aaaagttact ggatttggtt 
gaaaaaagaa aaagaaaaaa 
gccgcggggc gcggcggc 



ccccccaccc accagtcccc^ 
gggaggaagc gtaaggctga 
gcgaacggcg gggagggaat 
cgcgtctatg ggcgcaacgc 
cttccagtaa aggtgaaccc 
gcccggacgg cagcagtgcg 
tcaaattccc tgagcctcaa 
atttgggtag aagccctcat 
tggaacatgg ccaaacttca 
gaaggttcag .ttatgaggca 
taataaaagt tgaaataaag 
agaaagaaga aaaaagaaag 



ctgcattctc ggqpgggctc24C 
ggccc,cggtg gtcgccgtagSCO 
ggaggaggcg accgttgtta360 
cgcggccctg agccaggcgc420 
gacgaagccc cggagggcag480 
gaactctgga ctgggattaa540 
gaggtggtgg aagagttgaa600 
gctgagcttt gtgtccctgg660 
gtcatgatcc tgaagccatg720 
accctctagt aaggcattgt780 
taaaagaaaa aaaaaaaaga8 4 0 
gagaagcgag agaaagggag900 

918 



(2) INFORMATION OBER SEQ ID NO: 5: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1 146 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(ill) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 5 

tcagtttagt ggcagggtgg ttttttaatt ttcttctgtg gctggatttt tgttgtttgt 60 
ttaaataact cttctgggaa gttggtttat aagcctttgc caggtgtaac tgttgtgaaa 120 
tacccaccac taaagttttt taagttccat attttctcca ttttgccttc ttatgtattt 180 
tcaagattat tctgtgcact ttaaatttac ttaacttacc ataaatgcag tgtgactttt 240 
cccacacact ggattgtgag gctcttaact tcttaaaagt ataatggcat cttgtgaatc 300 
ctataagcag tctttatgtc tcttaacatt cacaectact ttttaaaaac aaatattatt 360 
actattttta ttattgtttg^ tcctttataa attttcttaa agattaagaa aatttaagac 420 
cccattgagt tactgtaatg caattcaact ttgagttatc ttttaaatat gtcttgtata 480 
gttcatattc atggctgaaa cttgaccaca ctattgctga ttgtatggtt ttcacctgga 540 
caccgtgtag aatgcttgat tacttgtact cttcttatgc taatatgctc tgggctggag 600 
aaatgaaatc ctcaagccat caggatttgc tatttaagtg gcttgacaac tgggccacca 660 
aagaacttga acttcacctt ttaggatttg agctgttctg gaacacattg ctgcactttg 720 
gaaagtcaaa^ atcaagtgcc agtggcgccc tttccataga gaatttgccc agctttgctt 780 
taaaagatgt* cttgtttttt atatacacat aatcaatagg tccaatctgc tctcaaggcc 840 
ttggtcctgg tgggattcct tcaccaatta ctttaattaa aaatggctgc aactgtaaga 900 
acccttgtct gatatatttg caactatgct cccatttaca aatgtacctt ctaatgctca 960 
gttgccaggt tccaatgcaa aggtggcgtg gactcccttt gtgtgggtgg ggtttgtgggl020 
tagtggtgaa ggaccgatat cagaaaaatg ccttcaagtg tactaattta ttaataaacal080 
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ttaggtgttt gttaaaaaaa aaaaaaaaaa aaaaaaaaag; ggtgggtggt Qttggttgtall40 
ttggtt 114 6 

(2) INFORMATION OBERSEQ ID NO: 6: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 2407 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: ekizel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHRE1BUNG: SEQ ID NO: 6 

gagtgagtga gtgtgttgca tcgaattaag gactcttgaa gagaagagag gtccattcag 60 
ggttgtccag attgaagtga ggtctcacgg tgaaaagaaa aggaaaatat tcagactctc 120 
ttgaaatcca aagagcaaga agtaaatgaa cttctgcaaa aattccagca agctcaggaa 180 
gaacttgcag aaatgaaaag ataqtctgag agctcttcaa aactggagga agataaagat 240 
aaaaagataa atgagatgtc gaaggaagtc accaaattga aggaggcctt gaacagcctc 300 
tcccagctct cctactcaac aagctpatcc aaaaggcaga gtcagcagct ggaggcgctg 360 
cagcagcaag tcaaacagct ccagaaccag ctggcggaat gcaagaaaca acaccaggag 420 
gtcatatcag tttacagaat gcatcttctg tatgctgtgc agggccagat ggatgaagat 480 
gtccagaaag tactgaagca aatccttacc atgtgtaaaa accagtctca aaagaagtaa 540 
agtggattcc ttggcaggac actgcccctt gtcatctgtc tttgtgttag atccagagtt 600 
gtcggcagcc gctgccattg ttctcattcg tggtatgcac tgtggcctag cgtagttctt 660 
ccctttccaa aggtttctga ggacttctcc caggagaaga ctgcccgcct cagaactgct 720 
tagagacttc aaaccagcag aggtgaaagt ccctgtcatc ccttcagatt ccagagctgg 780 
gatcagccat gcccagaggt ctggtcctga tgctggcagg ggggccccct cctccatccc 840 
tgactggctg agtggcttta tcaccaccga gtgatgtgct gaggcctcct gcagtgaatg 900 
ctccttccat tcctgtactc gggcagtgcc attcagcaca ggagagctct ttttgccttt 960 
ggctttcaat tccaaaacat gatttaattt ctaactaaat tagtatggca ctagttatgal020 
agtatctgct taaaaccct;t catcatgata tcctgtggat ttaaaaactc taattccatgl080 
ttttcttccc atctgcctta tatatctcat caccctgctt atcaatattc agtttgatgall40 
gcactattaa ctaaaatatg aaacttaaaa acaaaagcaa gttgtcctta aaagttctttl200 
ttttaagtaa attgttgaca tactgcaaat tttctatgca aacttgcctc ctgctgttatl260 
ctgtgaagct caggaaatcc aaacatttgt gtttcaacaa gggacagtaa actgtgtgttl320 
tacagccaaa agaaatgcct catagttctt aacctcaact tttgtagaag tatttttttcl380 
tctgtaatat ttttattggc tcataaagat gttttcatat ctgaactcct aaataagtgal440 
aattacagta gattatatta acaaaatact ttttaggtag ccatgcttga gactttttaalSOO 
aaatataact ttttccttaa agttttcagc tatagcaaaa ggtagttatg tatgccagacl560 
ctaatatgag ctgccaccaa cacccctaga actttcagcc atggtgtctt cagaattgtal620 
gcgcatttct gaatctagca aatcctcctt ttacccgttg aatgttttga atgccctgacl680 
tctaccagcg cccataaatg atctctagaa ggactgttag taccaatctg tttttcaactl740 
ttgaagctaa aaaccctgat atggtaatat tatggtgcat agcagaggtc tcggaaaaaal800 
aatatttctg ttcactttac tttcaggtta aaaatgtttc taacacgctt gcaacttcccl860 
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ttatggcatt aatcttgttg agggagagag acagaatcct ggactctcca aa^tatttca!923 
ctgaaagtag ggcctgctct gacagggccp atgtcccacffciggctgcttg gcctcagtygl330 
gtgcttggct gtgctggatg atatgttgat ctgtattgga taaggaccaa tgacagcaaa2040 
gcaaaaatgg ctttaaagct tggtgttact tttcttaagt tgtttaatta tagttaagca2100 
atttcaaaaa tgctccaaag aaatgtgaaa ggaccttttg tcacagcact tcagaaaata2160 
cacaacagcc ccttctgccc ccgcacagaa atgctgcaga gtatataaaa cttgagacat2220 
ttttgtagga tgcctgacga ggtgtagcct tttatcttgt ttccggatgc atatttatta2280 
cgagtactct ggttaaatat tgaaaagtta tatgctgtag tttttagtat tttgtctttg2340 
taatttacag aagttattgg agaaaataaa cttgtttcat tttgcaaaaa aaaaaaaaaa2400 
atgaaaa 2407 

(2) INFORMATION UBER SEQ ID NO: 7: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1471 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte parh'elle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 7 

ctcgtgcaac ccggcggctc ctgcagcggt ggtcggctgt tgggtgtgga gtttcccagc 60 
gcccctcggg tccgaccctt tgagcgttct gctccggcgc cagctacctc gctcctcggc 120 
gccatgacca caaccaccac cttcaaggga gtcgacccca acagcaggaa tagctcccga 180 
gttttgcggc ctccaggtgg tggatccaat ttttcattag gttttgatga accaacagaa 240 
caacctgtga ggaagaacaa aatggcctct aatatctttg ggacacctga agaaaatcaa 300 
gcttcttggg ccaagtcagc aggtgccaag tctagtggtg gcagggaaga cttggagtca 360 
tctggactgc agagaaggaa ctcctctgaa gcaagctccg gagacttctt agatctgaag 420 
ggagaaggtg atattcatga aaatgtggac acagacttgc caggcagcct ggggcagagt 480 
gaagagaagc ccgtgcctgc tgcgcctgtg cccagcccgg tggccccggc cccagtgcca 540 
tccagaagaa atccccctgg cggcaagtcc agcctcgtct tgggttagct ctgactgtcc 600 
tgaacgctgt cgttctgtct gtttcctcca tgcttgtgaa ctgcacaact tgagcctgac 660 
tgtacatctc ttggatttgt> ttcattaaaa agaagcactt tatgtactgc tgtctttttt 720 
ttttttcttt tgaagaacag gtttctctct gtccttgact cttgggtctg tgggccatgg 780 
catgagtgtt ttctagtagt agattggagg gaaagctttg tgacacttag tactgtgttt 840 
ttaagaagaa ataatttggt tccagatgtg ttagaggatc ttttgtactg aggtttttaa 900 
cactttactt gggtttacca agcctcaact ggacagacca taaacagtcc acaggcaccg 960 
ttcctgccag gccccaaccc acagggagtc tctccgcaga gccttcttgg tgttgccctal020 
acttgccagt ggcctttgct cagagcctcc tcctgtgaca tgtgaacaat gaagaggcctl080 
gcgcctcctg ccttgccgcc tgcaaagcaa agaaactgcc ttttattttt taaccttaaall40 
aagtagccag atagtaacaa gactggctgg ctgatgagca aagcctttgrc tctcacgcagl200 
aggaaggctt ggatgtacaa tgaaactgcc tggaactaaa agcagtgaag caagggaggcl260 
aatcacactg aagcgggtct tcctccagga acggggtccc acaggcgtgt tgttttaaatl320 
aacctgatgc tgtgtgcatg atgctggtgc ttgaccatga aaggaaagtc tcatccttaal380 
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aatgtgttgt acttcacaat cctggactgt tgctrcaagt eaaccatatc cacattttga!440 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a ' 1471 

(2) INFORMATION UBER SEQ ID NO: 8: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1732 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte parBelle cDNA 

(ill) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 8 

gcagaaccta cgcctgacgg gcccggcggc ggctgagccg cgctgcgcag cgacgcggga 60 
atgaagcggg cgctgggcag gcgaaagggc gtgtggttgc gcctgaggaa gatacttttc 120 
tgtgttttgg ggttgtacat tgccattcca tttctcatca aactatgtcc tggaatacag 180 
gccaaactga ttttcttgaa tttcgtaaga gttccctatt tcattgattt gaaaaaacca 240 
caggatcaag gtttgaatca cacgtgtaac tactacctgc agccagagga agacgtgacc 300 
attggagtct ggcacaccgt ccctgcagtc tggtggaaga acgcccaagg caaagaccag 360 
atgtggtatg aggatgcctt ggcttccagc caccctatca ttctgtacct gcatgggaac 420 
gcaggtacca gaggaggcga ccaccgcgtg gagctttaca aggtgctgag ttcccttggt 480 
taccatgtgg tcacctttga ctacagaggt tggggtgact cagtgggaac gccatctgag 540 
cggggcatga cctatgacgc actccacgtt tttgactgga tcaaagcaag aagtggtgac 600 
aaccccgtgt acatctgggg ccactctctg ggcactggcg tggcgacaaa tctggtgcgg 660 
cgcctctgtg agcgagagac gcctccagat gcccttatat tggaatctcc attcactaat 720 
atccgtgaag aagctaagag ccatccattt tcagtgatat atcgatactt ccctgggttt 780 
gactggttct tccttgatcc tattacaagt agtggaatta aatttgcaaa tgatgaaaac 840 
gtgaagcaca tctcctgtcc cctgctcatc ctgcacgctg aggacgaccc ggtggtgccc 900 
ttccagcttg gcagaaagct ctatagcatc gccgcaccag ctcgaagctt ccgagatttc 960 
aaagttcagt ttgtgccctt tcattcagac cttggctaca ggcacaaata catttacaagl020 
agccctgagc tgccacgga.t actgagggaa ttcctgggga agtcggagcc tgagcaccagl080 
cactgagcct ggccgtggga aggaagcatg aagacctctg ccctcctccc gttttcctccl!40 
agtcagcagc ccggtatccb gaagccccgg ggggccggca cctgcaatgc tcaggagcccl200 
agctcgcacc tggagagcac ctcagatccc aggcggggag gcccctgcag gcctgcagtgl260 
cccggaggcc tgagcatggc tgtgtggaaa gcgtgggtgg caggcatgtg gctctccttgl320 
ccgcccctoa acctgagatc ttgttgggag acttaatggc agcaggcagc catcactgccl380 
tggttgatgc tgcactgagc tggacagggg gagtccgggc aggggactct tggggctcgg!440 
gaccatgctg agctttttgg caccacccac agagaacgtg gggtccaggt tctttctgcalSOO 
ccttcccagc acatgcagaa tgactccagt ggttccatcg tcccctc'ctg ccctgtgtacl560 
ctgcttgcct ttctcagctg ccccacctcc cctgggctgg cccactcacc cacagtggaa!620 
gtgcccggga tctgcacttc ctcccctttc acctacctgt acacctaacc tggccttagal680 
ctgagcttta tttaagaata aaatcgtggt ggtggtcaaa aagacactct gc 1732 
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(2) INFORMATION UBER SEQ ID NO: 9: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 989 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 



(iii) HYPOTHETISCH: NEIN 
i) ANTI-SENSE: NEIN 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 9 

cggctcgagc gtgatcgtcg actcagctga ccctgcggga ccggaaaqag aaattcccgg 60 
gccctggctt cttggcgcga tgaggttccg gttctgtggt gatctggact gtcccgactg!20 
ggtcctggca gaaatcagca cgctggccaa gatgtcctct gtgaagttgc ggctgctctg!80 
caccaggtac taaaggagct gctgggacag gggattgatt atgagaagat cctgaagctc240 
acggctgacg ccaagtttga gtcaggcgat gtgaaggcca cagtggcagt gctgagtttc300 
atcctctcca gtgcggccaa gcacagtgtc gatggcgaat ccttgtccag tgaactgcag360 
cagctggggc tgcccaaaga gcacgcggcc agcctgtgcc gctgttatga ggagaagcaa420 
agccccttgc agaagcactt gcgggtctgc agcctacgca tgaataggtt ggcaggtgtg480 
ggctggcggg tggactacac cctgagctcc agcctgctgc aatccgtgga agagcccatg540 
gtgcacctgc ggctggaggt ggcagctgcc ccagggaccc cagcccagcc tgttgccatg600 
tccctctcag cagacaagtt ccaggtcctc ctggcagaac tgaagcaggc ccagaccctg660 
atgagctccc tgggctgagg agaagggtgt tccaggcctg tgtggagccg ccctgcccgt720 
atggagtcac gccctctgaa ctgctcttcg ggaggcagcc ctggttctag gatgctgagg780 
ccctggcccg gactctggcc tcccagatcc ccagctgcct cacttctctc ttgagaactt840 
ggctcagggc tcctgaggac ctttcccagc attaccttcc cttcccttga aaggcaattg900 
ttggctgttt tcataagcag gaaaaataaa cagaagtata aaagagaaaa* aaaaaaaaaa960 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 989 

(2) INFORMATION UBER SEQ ID NO: 10: 

(i) SEQUENZ CHARAKTERISTIK: : 

(A) LANGE: 150 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
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(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 10 

attttatgaa gttgaaaaat agctcacttt aaagctagtt ttgaagacgt gcagctgtga 60 
cttgggtctg gttgggggtg ttgtgttttg agtcagccgt tttcactccc actgaggttgl20 
tcagaacatg cagattgctt cgattttctc 150 



(2) INFORMATION UBER SEQ ID NO: 11: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1467 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEK0LTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 11 



cgaggaccgg ccttgcgagc ggcgacgact ataaaatggc gcgtgctgca acccgcgccc 60 
gcttcggaga gagaaatgct ggggtgcagc tticaagctta ggaccaccca ccatgcctat 120 
ccaggtgctg aagggcctga- ccatcactca ttaagaacag aggaggctgc ctgttactcc 180 
tggtgttgca tccctccaga cactctgctg tttcctgcct aggcgtggct gcagccatgg 240 
ctaggaaagc gctgccaccc acccacctgg gccagagctg gttctgctcc tgctgcaggg 300 
acactgagct ggctatctcg gcgcttcggg caagaactgc aacaggctct cctgggtcct 360 
gcaggtgtac agccgggccc ctgccttgtg cctcagctct cgagagctgc tgctgccggg 420 
tgacctgatc caacctgata aggtgccatc ttcagctacc actgcaaggc cctgagggca 480 
acagcagcac ggcactgccc acccggctgc tgatggcctg gtgccagctg ggagtcctcc 540 
cggcacttcg aggccactga gccacccttc cagccccagc ccaccatgga caggggtatc 600 
cagcttcctc ctcaacctcg tcctctgccc ctgagccagt gacgcccaag gacatgcctg 660 
ttacccaggt cctgtaccag cactagctgg tcaagggcat gacagtgctg gaggccgtct 720 
tggagatcca ggccatcact ggcagcaggc tgctctccat ggtgccaggg cccgccaggc 780 
caccaggctc atgctgggac ccaacccagt gcacaaggac ttggctgctg agccacacac 840 
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ccaggagaag gtggataagt gggctaccaa 
ccccgcttcc ctattgtgac caggcctatg 
cctgggcctg gctctcaagg acagacaccg 
ccagaatcct gggggagctg ctcctggttt 
aggggaggca aaaagccttg ggcactaccc 
ctagccacac cctgacacca tgttcaaggg 
ctcccctgtg ggtgtcactg gccagatgtc 
tgaccatgag tccctggggg gagtgatccc 
cccaggcagc agggtgggtg ggtaccatgg 
aagcactgca ggccaagcag ggcaacccca 
ttaaaaaaaa aaaaaaaaaa aaaataa 



gggcttcstg caggctaggg gcggagccac .900 
gggaggagct" gtccalacgc caccgtgaga SCO 
cctggcctgg tgctccaggg gtgaagcaggl020 
gagctgcatt caggaagtgc gggacatggtl080 
tccctgtgga gctgttcggt gtccgtcgagll40 
taccggaaga gaagggtgtc tgcccccaac!200 
atgagggaag caggccttgt gagtggacacl260 
ccaggcatcg tgtgccatgt tgcacttctgl320 
gtgcccaccc ctccaccaca tggggccccal380 
cacccttgac ataaaagcat cttgaagcttl440 

1467 



(2) INFORMATION UBER SEQ ID NO: 12: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 895 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte parBelle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 12 



ctcgagccgc tcgagaacct cacttcctta ctcctccaaa aagaagtggg gaaagaacca 60 
tcaaaccttt cctcctgact taccaaacca ggaaaacagc aggagagggt ggctcaggacl20 
ttagggacag ggtatagctt agatggtgga aagcaaagga gagcaggaag ttgtaaatcal80 
ctggctaatg agaaaaggag acagctaact ctaggatgaa gctgtgacta ggctggagtt240 
gcttccttga agatgggact ccttgggtat caagacctat gccacatcac actggggcta300 
gggaagtagg tgatgccagc cctcaagtct gtcttcagcc agggacttga gaagttatat360 
tgggcagtgg ctccaatctg tggaccagta tttcagcttt ccctgaagat caggcagggt420 
gccattcatt gtctttctct cctagccccc tcaggaaaga aggactatat ttgtactgta480 
ccctaggggt tctggaaggg aaaacatgga atcaggattc tatagactga taggccctat540 
ccacaagggc catgactggg aaaaggtatg ggagcagaag gagaattggg attttagggt600 
gcagtacgct caccctaaao ttttggtggc ctggggcatg tcttgaggcc cagactgtta660 
agcaggctct gctggcctgt ttactcgtca ccacctctgc acctgctgtc ttgagactcc720 
atccagcccc aggcacgcca cctgctcctg agcctccact atctccctgt gacgggtgaa780 
cttcgtgtac tgtgtctcgg gtccatatat gaattgtgag cagggttcat ctattttaaa840 
cacagatgtt tacaaaataa agattatttc aaaccaaaaa gaaaaaaaaa aaaaa 895 



(2) INFORMATION UBER SEQ ID NO: 13: 



(i) SEQUENZ CHARAKTERISTIK: 
(A) LANGE: 467 Basenpaare 
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(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editieaing 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 13 

acccagcagc cctcgcgcgg tccggcacag cggacaccag gactccaaaa tggcgtcagt 60 
tgtaccagtg aaggacaaga aacttctgga ggtcaaactg ggggagctgc caagctggatl20 
cttgatgcgg gacttcagtc ctagtggcat tttcggagcg tttcaaagag gttactaccgl80 
gtactacaac aagtacatca atgtgaagaa ggggagcatc tcggggatta ccatggtgct240 
ggcatgctac gtgctcttta gctactcctt ttcctacaag catctcaagc acgagcggct300 
ccgcaaatac cactgaagag gacacactct' gcaccccccc accccacgac cttggcccga360 
gcccctccgt gaggaacaca atctcaatcg ttgctgaatc ctttcatatc ctaataggaa420 
ttaacctcca aataaaacat gactggtacg tgtaaaaaaa aaaaaaa 467 



(2) INFORMATION UBER SEQ ID NO: 14: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 511 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 14 
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actagttcta gatcgcgagc 
tctttttttt ttttttgtgg 
gggaggagag cgtgtgcggc 
ctgcctctgg cttctcaggc 
agctgcagcc catcctcccg 
tcagagacaa cttcccaaag 
agactctgta cctattttgt 
gctggaataa agcatgtgga 
aaaaaaaaaa aaaaaaaaaa 



ggccgccctt tttgttcaaa 
atggggactt gtgaattttt 
tccagcccag cccgctgctc 
ctctgctctc cgacctctct 
gcgccctcct agtctgtcct 
cacaaagcag tttttccccc 
atgtgtataa taatttgaga 
aatgacccca aacataaaaa 
aaaaaaaaaa a 



gtctattttt attctctttt 60 
ctaaaggtgc tatttaacatl20 
actttccacc ctctctccacl80 
cctctgaaac cctcctccac240 
gcgtcctctg tccccgggtt300 
taggggtggg aggaagcaaa360 
tgtttttaat tattttgatt420 
aaaaaaaaaa aaaaaaaaaa480 

511 



(2) INFORMATION UBER SEQ ID NO: 15: 

(i) SEQUENZ CHARACTERISTIC 

(A) LANGE: 1899 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblieaing und Editierung 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 
(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 
(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 15 

tctccaccct ccccttcctt ctttcttttt ccccttcttg cacgtggatc 
cagaatgagg ctgctttatt ggaagctatt ctgacatcac tttccagact 
cttgggacca ggcatgggag .gcaggggtgg gaatcttctt gtgattgtgg 
gaggagtgga gtggtgggag gcggctcagt cgcgggcact cctgcgactg 
tccggatgga ataagccttc aggagcccag gacccgcact gctggagaag 
tgctgcccat ggttcccccg agggtcagcc caatgccacc gccactgcta 
tggaattcat cacagagata ttcacggctc ccactccatc tccagccaac 
cgccctccag cagcttgcgg taggtggcga tctcgatgtc cagggccagc 
tgagttcctg gtactcacgc agctgccgtg ccatatcctg cttggcccgc 
cttccagctc ctcctgcttg. gcacgagcat ccttgagcgc cagctcccca 
cctcggcaat ggcggcctcc aacttggcac gctggttctt gatgttgtcg 
gcagcctctg gatggcccgg ttcatctctg aaatctcatt ccgggtattc 
ccccatgctt cccagcctgg gcctggaggg tctcaaactt ggtctggtac 
cctcagcccg gctgcatttg gccatctcct catactgcgc cttgacctca 
cgtccaggtc cagggagcga ctgttgtcca tggacagcac cacagatgtg 
gggactgcag ctctgtcaac tccgtctcat tgagggtcct gaggaagttg 
tcagggcatc caccttggcc tccagctcca ccttgctcat gtaggcagca 
tcttcagcac cacaaactca. ttctcagcag ctgtgcggcg gttaatttca 
tattcttgaa gtcctccacc acatcctgcc atctccgcag ctccgcctcc 
catccacctg cagtgcctca agctgacccc gaaggccagc aatctgggcc 
ctgggaggcg gctgctcttg gccgacttct gctcctgcag cagcgtccac 



actcaggcct 60 
gtctcactgt 120 
gtggtggctg 180 
gcggatgcgg 240 
ctcagggcat 300 
ctgccaccag 360 
cggctctcct 420 
ttcacgctca 480 
tgcagggcgg 540 
cgctcctcag 600 
atctcagcct 660 
cggaggtcgt 720 
caggcttcag 780 
gcgatgatgc 840 
tcggagatct 900 
atctcatcat 960 
tccacatcctl020 
tcttcgtactl080 
aggcggccccl 14 0 
tcaaagatgtl200 
ttggtctccal260 
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gcagcttgtt ctgctgctcc agaaaccgca ccttqtcgat .gaaguaggoa aacttgttgt3 320 
tgagggtctt gatctgctcg ctctcctcct ggcgcacccg'ctggagggag gggtcggcgtl380 
ccagccgcag cggggccagc aggctctggt taatggtgac ctcgcggatg ccggcgcccal440 
ccgggccccc ataggcagag cgcacggcca cgcgcggccg tgaggcgccg aggccgtagalSOO 
ggctgctgct gccaaggccg ccggggcgag cggagctcag gcgcacctgg gcgccgcggcl560 
ccgagaaggc ggctgagcgc gaggtgaata ccggggagct gaagtggatg gacatggtggl620 
ctgggccggg atggacctag cggcgggcga ggaggagcgc gcactcgctg acctcggggal680 
cactccgcac cttttatccg cgggagccgg tgctgggctt ccacaggtag gggcggggctl740 
ggccgcgggc accgtttctc tgctgccagg cccctcctgc gcgtccgtcc gccctctgccl800 
cgcccgcccc gccgaagccc aggctttcag tccaagcagg gatggtccgg agtaggcaggl860 
agcgccatcc ctagacggcc gcagagaaca gcgggggac 1899 

(2) INFORMATION UBER SEQ ID NO: 1 6: 

(i) SEQUENZ CHARACTERISTIC 

(A) LANGE: 758 Basenpaare 

(B) 7YP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEK0LTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 16 

cggctcgagc ggctcgaggg gcaagaagaa catcacgtgc tgtgacaccg acttgtgcaa 60 
cgccagcggg gcccatgccc tgcagccggc tgccgccatc . cttgcgctgc tccctgcactl20 
cggcctgctg ctctggggac ccggccagct ataggctctg gggggccccg ctgcagcccal80 
cactgggtgt ggtgccccag gcctctgtgc cactcctcac agacctggcc cagtgggagc240 
ctgtcctggt tcctgaggca catcctaacg caagtctgac catgtatgtc tgcacccctg300 
tgccccaccc tgaccctccc atggccctct cbaggactcc cacccggcag atcagctcta360 
gtgacacaga tccgcctgca gatggcccct ccaaccctct ctgctgctgt ttccatggcc420 
cagcattctc cacccttaac cctgtgctca ggcacctctt cccccaggaa gccttccctg480 
cccaccccat ctatgacttg agccaggtct ggtccgtggt gtcccccgca cccagcaggg540 
gacaggcact caggagggcc cagtaaaggc tgagatgaag. tggactgagt agaactggag600 
gacaagagtc gacgtgagtt cctgggagtc tccagagatg gggcctggag gcctggagga660 
aggggccagg cctcacattc gtggggctcc ctgaatggca gcctgagcac agcgtaggcc720 
cttaataaac acctgttgga taagccaaaa aaaaaaaa 758 



(2) INFORMATION UBER SEQ ID NO: 17: 

(i) SEQUENZ CHARACTERISTIC 

(A) LANGE: 302 Basenpaare 

(B) TYP: Nukleinsaure 
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(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHET1SCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 17 

cggctcgacg gtctcgagat caagcatgaa ttgcaagcaa actgctacga ggaggtcaag 60 
gaccgctgca ccctggcaga gaagctgggg ggcagtgccg tcatctccct ggagggcaagl20 
cctttgtgag cccctttctg cgcccccttg cctgggagca tctgggcagg ccccaacaccl80 
ttgccctttg ggggtttgca gggctcgccc cctttcctgg ccagaaccgg gagggggctg240 
gqgggggatt cccaggcagg gggggagggg ccaattocct tttcaaccce caggttgggc300 
ca 302 



(2) INFORMATION UBER SEQ ID NO: 18: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 824 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: - 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 18 

ggcgaggaca gaggaggcgc gtccggcctt cctgaacacc ttaggctggt ggggctgcgg 60 
caagaagcgg gtctgtttct ttacttcctc cacggagtcg gcacactatg gctgcccctgl20 
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ggctcccaga acccacaaca tgaaagaaat 
aatccggaca caaaaccctc tagcttggaa 
ctacctgact caggaatcgg ctctggaagg 
caacatcaaa gacaccatcg gaacagcagc 
ccatcttcat ggccaccccc tgcggcggac 
agctgagctc ctccagcggg atgacgccgt 
ccttctgtct ctttgtttct gagctttcct 
ctccacgact ctgtttcccc cgtcccttct 
ggttggagtt gtggagacgg ccttgagtct 
ttggcaagtg cctgtgcagg gcccggccgc 
ggctgtgtgc ccgaaggcct caccctggcc 
,cgaacatcaa gggaggcaag cctttcaagg 



ggtgccaccc agctcaagnc tgggcctttg3 80 
atgaatatgc tgcectttac aaccactgca2'.0 
tgaagctaga ggaaccagac ctcatcagcc300 
gcccgcagca cccaccccgc accggcgact360 
ggttgaccac cagccaccac atcatcccag420 
ccccaccacc tccctcttct tctttttcat480 
gtctttcctt ttttctgaga gattcaaagc540 
gaatttaatt tgcactaagt catttgcact600 
cagtacgagt gtgcgtgagt gtgagccacc660 
cctccatctg ggccgggtga ctggggcgcc720 
cttcggcctt agtctgggaa ggttccgaac780 
catttccatt aatt 824 



(2) INFORMATION OBERSEQ ID NO: 19: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 2190 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 19 

tttttttttt tttttttttt tttttttttt tttttttttt tttcatgact tttaatgctt 60 
tattgggatt gcaagcgtta caaggttaaa gacaaaaccc aagcatggga ttttgccgga 120 
aatattagcg ttaaaggagc tgagttgagt caaacacggg ccgcaaggtg gaccgaggcg 180 
..gcaggcacag gtgacattca gtgtttggcg tgggggtctt caggtgatgg cagaggaggg 240 
gacccaagag ggggcccccc actgaagaca ttggggacac ggggaggaga caagatggag 300 
agccacgact aggcacggag gtcagacagg cagcccgggc caggatggtt agtggcccag 360 
gggagagctg caaacctggg gacgcaaggg gctggtcggc aagtgccccc gggaacaccc 420 
actccggcga ggcagaata^ aacactgggt gggtgggtgt cctgacgaat gggcaggtaa 480 
tttggggtgc ctcgaagcgt tttggatctc aggccaatgt gggttccaca attgtgacaa 540 
tttggctctt tgggcttctg tccaatgttc cgaatggccc actcacaggg cgcttgccga 600 
gggaccctrdt gcgacgctcg agctcgagcc gaaatgaggg aacccccaaa tttcatgtca 660 
attgatctat tccccctctt tgtttcttgg ggcagttttt tttttacccc tccttagctt 720 
tatgcgctca gaaaccaaat taaacccccc ccccatgtaa caggggggca gtgacaaaag 780 
caagaacgca cgaagccagc ctggagacca ccacgtcctg ccccccgcca tttatcgccc 840 
tgattggatt ttgtttttca tctgtccctg ttgcttgggt . tgagttgagg gtggagcctc 900 
ctggggggca ctggccactg agcccccttg gagaagtcag aggggagtgg agaaggccac 960 
tgtccggcct ggcttctggg. gacagtggct ggtccccaga agtcctgagg gcggagggggl020 
gggttgggca gggtctcctc aggtgtcagg agggtgctcg gaggccacag gagggggctcl080 
ctggctggcc tgaggctggc cggaggggaa ggggctagca ggtgtgtaaa cagagggttcll40 
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catcaggctg gggcagggtg gccgccttcc gcacftcttga ggaaccctc-.c eccctccctci200 
ggtgacatct tgcccgcccc tcagcaccct gcctcgcctc caggaggtcc gaagctctgtl260 
gggacctctt gggggcaagg tggggtgagg ccggggagta gggaggtcag gcgggtctgal320 
gcccacagag caggagagct gccaggtctg cccatcgacc aggttgcttg ggccccggagl380 
cccacgggtc tggtgatgcc atagcagcca ccaccgcggc gcctagggct gcggcagggal440 
ctcggcctct gggaggttta cctcgccccc acttgtgccc ccagctcagc ccccctgcaclSOO 
gcagcccgac tagcagtcta gaggcctgag gcttctgggt cctggtgacg gggctggcatl560 
gaccccgggg gtcgtccatg ccagtccgcc. tcagtcgcag agggtccctc ggcaagcgccl620 
ctgtgagtgg. gccattcgga acattggaca gaagcccaaa gagccaaatt gtcacaattgl680 
tggaacccac attggcctga gatccaaaac gcttcgaggc accccaaatt acctgcccatl740 
tcgtcaggac acccacccac ccagtgttat attctgcctc gccggagtgg gtgttcccggl800 
gggcacttgc cgaccagccc cttgcgtccc caggtttgca gctctcccct gggccactaal860 
ccatcctggc ccgggctgcc tgtctgacct ccgtgcctag tcgtggctct ccatcttgtcl920 
tcctccccgt gtccccaatg tcttcagtgg ggggccccct pttgggtccc ctcctctgccl980 
atcacctgaa gacccccacg ccaaacactg aatgtcacct gtgcctgccg cctcggtcca2040 
ccttgcggcc cgtgtttgac tcaactcagc tcctttaacg ctaatatttc cggcaaaatc2100 
ccatgcttgg gttttgtctt taaccttgta acgcttgcaa tcccaataaa gcattaaaag2160 
tcatgaaatt caaaaaaaaa aaaaaaaaaa 2190 



(2) INFORMATION UBER SEQ ID NO: 20: 

(i) SEQUENZ CHARAKTERIST1K: 

(A) LANGE: 2565 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus elnzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 20 

ctcccccacc tgtggcccgc aagccgtctg tgggagtccc ggaccccgcc tcccccagtt 60 
accctcgagc tgagcccctt actgctcctc ccaccaatgg gctccctcac acccaggaca 120 
ggactaagag ggagctggcg^ gagaatggag gtgtcctgca gctggtgggc ccagaggaga 180 
agatgggcct cccgggctca gactcacaga aagagctggc ctgaccacca ggcacctcac 240 
tggcactgct gacccatccc agaaacacaa tctcagggac ccgagcagct ccaaggacga 300 
gaggataoag cagacacaac ctaatagaga gggcgcctgc agccttaacc tccacggcct 360 
tcgatactta tgcaagcctg gtgttgctcc tgtcctcaga gtcatcctgc gctcatgcct 420 
tttcccgaat gggttcacct ctggcagttg ccgcttcagt cttggcctta gcctcatctt 480 
gaagtgggta gctggcggga gagggtggct gcgccccctg ctggccctga ggctgcagag 540 
ttgggagcag gacacctcac ctgagtttca ttttttttca tgtccaaacc atgcacatac 600 
tatagtccag aatcaaagca cttttgaaaa gtggctgcat ggccatcctc cagggcccag 660 
gaagttgcat tccaagggcc tgtttacatg gcagcagaat ccatccccgg cagtcagccc 720 
atagcttggg accagtctgt gccctcctgc ccagtccagt ttactcctct tggttcctga 780 
aggtggccaa gtcattgtgt tcccacaggc ttctctaggc tgggggcagg tgtggggctg 840 
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tggaattcca aagcacaaaa ggtgcagagg ggattggcct tcctgtccct caactcacca 900 
accaccctcc tgccttccag ttctgccagg tgctccatgc -tg^ggacaag taggagactg 960 
ccagggccca aagaaatggg tgagcagtag agtcatctcg gggcacttgg cagtgtcaagl020 
cacctgcccc ttgcctcctt gaccacactg gggtgggtgg gcccccagca cttcagaggclO80 
aggagccttt gggctgagca agcactgagg aggtggatgg aagggagcat ctggagggggll40 
ggagcttcct tgagcagtgg gcccaggcct ggccctccac acttcattct ctgacctttcl200 
tctctcctca tttcggtgca tgtcctttct gcagctgcct ttcagcacag gtggttccacl260 
tgggggcagc taacgctgag tgacaaggat gggaagccac aggtgcattt tactcaagtcl320 
ttctctagtc aatgaggggc acccagtgct tctagggcag gctgggtggt ggtcccctagl380 
gtatcagcct ctcttactgt actctccggg aatgttaacc tttctatttt cagcctgtgcl440 
cacctgtcta ggcaagctgg cttccccatt ggcccctgtg ggtccacagc agcgtggctglSOO 
ccccccaggg ccaccgcttc tttcttgatc ctctttcctt aacagtgact tgggcttgagl560 
tctggcaagg aaccttgctt ttagcttcac caccaaggag agaggttgac atgacctcccl620 
cgccccctca ;ccaaggctgg gaacagaggg gatgtggtga gagccaggtt cctctggcccl680 
tctccagggt gttttccact agtcactact gtcttctcct tgtagctaat caatcaatat!740 
tcttcccttg cctgtgggca gtggagagtg ctgctgggtg tacgctgcac ctgcccactgl800 
agttggggaa agaggataat cagtgagcac tgttctgctc agagctcctg atctaccccal860 
ccccctagga tccaggactg ggtcaaagct gcatgaaacc aggccctggc agcaacctggl920 
gaatggctgg aggtgggaga gaacctgact tctctttccc tctccctcct ccaacattacl980 
tggaactcta tcctgttagg atcttctigag cttgtttccc tgctgggtgg gacagaggac2040 
aaaggagaag ggagggtcta gaagaggcag cccttctttg tcctctgggg taaatgagct2100 
tgacctagag taaatggaga gaccaaaagc ctctgatttt taatttccat aaaatgttag2160 
aagtatatat atacatatat atatttcttt aaatttttga gtctttgata tgtctaaaaa2220 
tccattccct ctgccctgaa gcctgagtga gacacatgaa gaaaactgtg tttcatttaa2280 
agatgttaat taaatgattg aaacttgaaa aaggctactg cttcttaatg ttggggggac2340 
agggcagtgg tctgggccca catttagaag ggaaaatgtt ttgcctgctg cacacattgg2400 
acccaagtat gggcctcttc tgcctagtac tgccaaaggg actgttaagg tgtcttgtcc2460 
atcttctacc ccccaccccc cattacaggg taaagggaac cccagactag gtgaggggcc2520 
agcagctgcc tcacacttgt gttctctcct gagatggtcc agctt 2565 



(2) INFORMATION UBER SEQ ID NO: 21 : 

(i) SEQUENZ CHARAKTERIST1K: 

(A) LANGE: 461 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEK0LTYP: aus einzelnen ESTs durch Assemblieaing und Editieaing 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 21 

gttcctccct agcggctcgc tcagaagcag agttctgggg tgtctccacc atggcctgga 60 
cccctctctg gctcactctc ctcactcttt gcataggttc tgtggtttct tctgagctgal20 
ctcaggaccc tgctgtgtct gtggccttgg gacagacagt caggatcaca tgccaaggagl80 
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acagcctcag aagctattat gcaagctggt accagcagaa gccagcacag gcccctgtae240 
ttgtcatcta tggtaaaaac aaccggccct cagggatccc • agaccgat to tctggct-cca300 
gctcaggaaa cacagcttcc ttgaccatca ctggggctca ggcggaagat gaggctgact360 
attacttgta aactccccgg gacagcaagt gggtaaccaa tgtgggtatt ccggcgggag420 
ggacccaagc ttgacccgtt cttaaggtca gcccaaaggg c 461 

(2) INFORMATION OBER SEQ ID NO: 22: 

(i) SEQUENZ CH ARAKTfe RISTI K: 

(A) LANGE: 2096 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(II) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 
hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 

(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 22 

atcccggtca cttttggtta caggacgtgg cagctggttg gacgagggga gctggtgggc 60 
agggtttgat cccagggcct. gggcaacgga ggtgtagctg gcagcagcgg gcaggtgagg 120 
accccatctg ccgggcaggt gagtcccttc cctccccagg cctcgcttcc ccagccttct 180 
gaaagaagga ggtttagggg atcgagggct ggcggggaga agcagacacc ctcccagcag 240 
aggggcagga tgggggcagg agagttagca aaggtgacat cttctcgggg ggagccgaga 300 
ctgcgcaagg ctggggggtt atgggcccgt tccaggcaga aagagcaaga gggcagggag 360 
ggagcacagg ggtggccagc gtagggtcca gcacgtgggg tggtacccca ggcctgggtc 420 
agacagggac atggcagggg acacaggaca gaggggtccc cagctgccac ctcacccacc 480 
gcaattcatt tagtagcagg cacaggggca gctccggcac ggctttctca ggcctatgcc 540 
ggagcctcga gggctggaga gcgggaagac aggcagtgct cggggagttg cagcaggacg 600 
tcaccaggag ggcgaacggc cacgggaggg gggccccggg acattgcgca caaaggaggc 660 
tgcaggggct cggcctgcgg gcgccggtcc cacgaggcac tgcggcccag ggtctggtgc 720 
ggagagggcc cacagtggac ttggtgacgc tgtatgccct caccgctcag cccctggggc 780 
tggcttggca gacagtacag catccagggg agtcaagggc atggggcgag accagactag 840 
gcgaggcggg cggggcgga^g tgaatgagct ctcaggaggg aggatggtgc aggcaggggt 900 
gaggagcgca gggggcggcg agcgggaggc actggcctcc agagcccgtg gccaaggcgg 960 
gcctcgcggg cggcgacggsb gccgggatcg gtgcctcagc gttcgggctg gagacgaggcl020 
caggtctcca gctggggtgg acgtgcccac cagctgccga aggcaagacg ccaggtccggl080 
tggacgtgac aagcaggaca tgacatggtc cggtgtgacg gcgaggacag aggaggcgcgll40 
tccggcctbc ctgaacacct taggctggtg gggctgcggc aagaagcggg tctgtttcttl200 
tacttcctcc acggagtcgg cacactatgg ctgccctctg ggctcccaga acccacaacal260 
tgaaagaaat ggtgctaccc agctcaagcc tgggcctttg aatccggaca caaaaccctcl320 
tagcttggaa atgaatatgc tgcactttac aaccactgca ctacctgact caggaatcggl380 
ctctggaagg tgaagctaga ggaaccagac ctcatcagcc caacatcaaa gacaccatcgl440 
gaacagcagc gcccgcagca cccaccccgc accggcgact ccatcttcat ggccaccccclSOO 
tgcggcggac ggttgaccac cagccaccac atcatcccag agctgagctc ctccagcgggl560 
atgacgccgt ccccaccacc tccctcttct tctttttcat ccttctgtct ctttgtttctl620 
gagctttcct gtctttcctt ttttctgaga gattcaaagc ctccacgact ctgtttccccl680 
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cgtcccttct gaatttaatt tgcactaagt 
ccttgagtct cagtacgagt gtgcgtgagt 
gcccggccgc cctccatctg ggccgggtga 
ccctgccctc gcctagtctg gaagctccga 
cattacgccc catctcgctc tgtgcccctc 
tcatcatcaa taaacactgt tacagcaaaa 
aaagaaaaag aaaaggaaaa aaagagaggg 



catctgcact ggttggagtt gtggagG>cqCil740 
gtgagccacc ttggsaagtg cctgtgcaggI300 
ctgggcgccg gctgtgtgcc cgaggcctcal860 
ccgacatcac ggagcagcct tcaagcattcl920 
cccaccaggg cttcagcagg agccctggacl980 
aaaaaaaaga aaaaaaaaag aaaagaaaaa2040 
aggaagaagg agaaaaggga gtgtgg 2096 



(2) INFORMATION UBER SEQ ID NO: 23: 

(i) SEQUENZ CHARAKTERISTIK: 
* (A) LANGE: 1348 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 

(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 



(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 23 

ctctcgcgag gccccagaga gcaggcgctg ggcagtgtgg aggtcgttgg agtcacttcc 60 
gcgtcaccag ctcctgtgcc tgccagtcgg tgcccctccc gctccagcca tgctctccgc 120 
cctcgcccgg cctgccagcg ctgctctccg ccgcagcttc agcacctcgg cccagaacaa 180 
tgctaaagta gctgtgctag gggcctctgg aggcatcggg cagccacttt cacttctcct 240 
gaagaacagc cccttggtga gccgcctgac cctctatgat atcgcgcaca cacccggagt 300 
ggccgcagat ctgagccaca tcgagaccaa agccgctgtg aaaggctacc tcggacctga 360 
acagctgcct gactgcctga aaggttgtga tgtggtagtt attccggctg gagtccccag 420 
aaagccaggc atgacccggg acgacctgtt caacaccaat gccacgattg tggccaccct 480 
gaccgctgcc tgtgcccagc actgcccgga. agccatgatc tgcgtcattg ccaatccggt 540 
taattccacc atccccatca cagcagaagt tttcaagaag catggagtgt acaaccccaa 600 
caaaatcttc ggcgtgacga ccctggacat cgtcagagcc aacacctttg ttgcagagct 660 
gaagggtttg gatccagctc gagtcaacgt ccctgtcatt ggtggccatg ctgggaagac 720 
catcatcccc ctgatctctcf agtgcacccc caaggtggac tttccccagg accagctgac 780 
agcactcact gggcggatcc aggaggccgg cacggaggtg gtcaaggcta aagccggagc 840 
aggctctgcc accctctcca tggcgtatgc cggcgcccgc tttgtcttct cccttgtgga 900 
tgcaatgaat ggaaaggaag gtgttgtgga atgttccttc gttaagtcac aggaaacgga 960 
atgtacctac ttctccacac cgctgctgct tgggaaaaag ggcatcgaga agaacctgggl020 
catcggcaaa gtctcctctt ttgaggagaa gatgatctcg gatgccatcc ccgagctgaal080 
ggcctccatc aagaaggggg aagatttcgt gaagaccctg aagtgagccg ctgtgacgggll40 
tggccagttt ccttaattta tgaaggcatc atgtcactgc aaagccgttg cagataaactl200 
ttgtatttta atttgctttg gtgatgatta ctgtattgac atcatcatgc cttccaaattl260 
gtgggtggct ctgtgggcgc atcaataaaa gccgtccttg attttaaaaa aaaaaaaaaal320 
aaaaaaaaaa aaaaaaaaaa aaaaaaaa 1348 
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(2) INFORMATION UBER SEQ ID NO: 24 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 358 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTH ETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 24 

aatgggggac cggagaagaa gtacaagagc acgggtcagg ggcgggactc cgacggctcc 60 
ttcttcc'tct acagcaggct aaccgtggac aagagcaggt ggcaggaggg gaatgtcttcl20 
tcatgctccg tgatgcatga ggctctgcac aaccactaca cgcagaagag cctctccctgl80 
tctccgggta aatgagtgcg acggccggoa agcccccgct ccccgggctc tcgcggtcgc240 
acgaggatgc ttggcacgta ccccgtgtac atacttccca ggcacccagc atggaaataa300 
agcacccagc gctgccctgg ggcccctgcg aaaaaaaaga aaaagaatcg aaaagggg 358 



(2) INFORMATION UBER SEQ ID NO: 25: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 89 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTH ETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 
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(A) BIBLIOTHEK: cDNA library 
(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 25 

gcccctagcc cctggcagac atagctgctt cagtgcccct tttcctctgc tggctagatg60 
gatgttgatg cactggaggt acttttagc 89 

(2) INFORMATION UBER SEQ ID NO: 26: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1632 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BjBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 26 

gacactggtt ggttctgata agaggcaggg gaggagaaag ccgaggaaga gggagttgcg 60 
gaagaggagg gagttaacaa gttctcttat ccaccatcac accgggagtg ttgtccagcc 120 
gtggaggagg aggacgatga agaagctgta aagaaagaag ctcacagaac ctctacttct 180 
gccttgrtctc caggatccaa gcccagcact tgggtgtctt gcccagggga ggaagagaat 240 
caagccacgg aggataaaag aacagaaaga agtaaaggag ccaggaagac ctccgtgtcc 300 
ccccgatctt caggctccga ccccaggtcc tgggagtatc gttcaggaga ggcgtccgag 360 
gagaaggagg aaaaggcaca caaagaaact gggaaaggag aagctgcccc agggccgcaa 420 
tcctcagccc cagcccagag gccccagctc aagtcctggt ggtgccaacc cagtgatgaa 480 
gaggagggtg aggtcaaggc tttgggggca gctgagaagg atggagaagc tgagtgtcct 540 
ccctgcatcc ccccaccaag tgccttcctg aaggcctggg tgtattggcc aggagaggac 600 
acagaggaag aggaagatga ggaagaagat gaggacagtg actctggatc agatgaggaa 660 
gagggagaag ctgaggcttc ctcttccact cctgctacag gtgtcttctt gaagtcctgg 720 
gtctatcagc caggagagga cacacagtga tacaggatca gccgaggatg aaagagaagc 780 
tgagacttct gcttccacac cccctgcaag tgctttcttg aaggcctggg tgtatcggcc 840 
aggagaggac actggatagt- gaggataagg aagatgattc agaagcagcc ttaggagaag 900 
ctgagtcaga cccacatccc tcccacccgg accagagggc ccacttcagg ggctggggat 960 
atcgacctgg aaaagagaca gaggaagagg aagctgctga ggactgggga gaagctgagcl020 
cctgcccctt ccgagtggcc atctatgtac ctggagagaa gccaccgcct ccctgggctcl080 
ctcctaggct gcccctccga ctgcaaaggc ggctcaagcg cccagaaacc cctactcatgll40 
atccggaccc tgagactccc ctaaaggcca gaaaggtgcg cttctccgag aaggtcactgl200 
tccatttcct ggctgtctgg gcagggccgg cccaggccgc ccgccagggc ccctgggagc!260 
agcttgctcg ggatcgcagc cgcttcgcac gccgcatcac ccaggcccag gaggagctgal320 
gcccctgcct cacccctgct gcccgggcca gagcctgggc acgcctcagg aacccaccttl380 
tagcccccat ccctgccctc acccagacct tgccttcctc ctctgtccct tcgtccccagl440 
tccagaccac gcccttgagc caagctgtgg ccacaccttc ccgctcgtct gctgctgcaglSOO 
cggctgccct ggacctcagt gggaggcgtg gctgagacca actggtttgc ctataatttal560 
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ttaactattt attttttcta agtgtgggtt tatetaagga ataaagcctt ttgatttgtal6Z0 
acgaaaaaaa aa 1632 



(2) INFORMATION UBER SEQ ID NO: 27: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 2972 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 27 

ccaggacgag cacctcatta cattcttcgt gcctgtcttt gagccgctgc cccctcagta 60 

cttcatccga gtggtgtctg accgctggct ctcttgtgag acccagctgc ctgtctcctt 120 

ccggcacctg atcttgccgg agaagtaccc ccctccaacc gaacttttgg acctgcagcc 180 

cttgcccgtg tctgctctga gaaacagtgc ctttgagagt ctttaccaag ataaatttcc 240 

tttcttcaat cccatccaga cccaggtgtt taacactgta tacaacagtg acgacaacgt 300 

gtttgtgggg gcccccacgg gcagcgggaa gactatttgt gcagagtttg ccatcctgcg 360 

aatgctgctg cagagctcgg aggggcgctg tgtgtacatc acccccatgg aggccctggc 420 

agagcaggta tacatggact ggtacgagaa gttccaggac aggctcaaca agaaggtggt 480 

actcctgaca ggcgagacca gcacagacct gaagctgctg ggcaaaggga acattatcat 540 

cagcacccct gagaagtggg acatactttc ccggcgatgg aagcagcgca agaacgtgca 600 

gaacatcaac ctcttcgtgg tggatgaggt ccaccttatc gggggcgaga atgggcctgt 660 

cttagaagtg atctgctccc gaatgcgcta catctcctcc cagattgagc ggcccattcg 720 

cattgtggca ctcagctctt cgctctccaa tgccaaggat gtggcccact ggctgggctg 780 

cagtgccacc tccaccttca acttccatcc caatgtgcgt cccgtcccct tggagctgca 840 

catccagggc ttcaacatca gccatacaca aacccgcctg ctctccatgg ccaagcctgt 900 

gtaccatgct atcaccaagc actcgcccaa gaagcctgtc attgtctttg tgccgtctcg 960 

caagcagacc cgcctcactg ccattgacat cctcaccacc tgtgcagcag acatccaacgl020 

gcagaggttc ttgcactgca ccgagaagga tctgattccg tacctggaga agctaagtgal080 

cagcacgctc aaggaaacgc tgctaaatgg ggtgggctac ctgcatgagg ggctcagcccll40 

catggagcga cgcctggtgg agcagctctt cagctcaggg gctatccagg tggtggtggcl200 

ttctcggagt ctctgctggg gcatgaacgt ggctgcccac ctggtaatca tcatggatacl260 

ccagtactac aatggcaaga tccacgccta tgtggattac cccatctatg acgtgcttcal320 

gatggtgggc cacgccaacc gccctttgca ggacgatgag gggcgctgtg tcatcatgtgl380 

tcagggctcc aagaaggatt tcttcaagaa gttcttatat gagccattgc cagtagaatcl440 

tcacctggac cactgtatgc atgaccactt caatgctgag atcgtcacca agaccattgalSOO 

gaacaagcag gatgctgtgg actacctcac ctggaccttt ctgtaccgcc gcatgacacal560 

gaaccccaat tactacaacc tgcagggcat ctcccatcgt cacttgtcgg accacttgtcl620 

agagctggtg gagcagaccc tgagtgacct ggagcagtcc aagtgcatca gcatcgaggal680 

cgagatggac gtggcgcctc tgaacctagg catgatcgcc gcctactatt acatcaactal740 

caccaccatt gagctcttca gcatgtccct caatgccaag accaaggtgc gagggcttat!800 



81 



• DE 198 18 619 A 1 



cgagatcatc tccaatgcag cagagtatga gaacat+iccc atccggcacc atgaagaCaal860 
tctcctgagg cagttggctc agaaggtccc ccacEcgctg aataacccta agttcaal;gal320 
tccgcacgtc aagaccaacc tgctcctgca ggctcacttg tctcgcatgc agctgagtgcl980 
tgagttgcag tcagatacgg aggaaatcct tagtaaggca atccggctca tccaggcctg2040 
cgtggatgtc ctttccagca atgggtggct cagccctgct ctggcagcta tggaactggc2100 
ccagatggtc acccaagcca tgtggtccaa ggactcatac ctgaagcagc tgccacactt2160 
cacctctgag catatcaaac gttgcacaga caagggagtg gagagtgttt tcgacatcat2220 
ggagatggag gatgaagaac ggaacgcgtt gcttcagctg actgacagcc agattgcaga2280 
tgtggctcgc ttttgtaacc gctaccctaa tatcgaacta tcttatgagg tggtagataa2340 
ggacagcatc cgcagtggcg ggccagttgt ggtgctggtg cagctggagc gagaggagga2400 
agtcacaggc cctgtcattg cgcctctctt cccgcagaaa cgtgaagagg gctggtgggt2460 
ggtgattgga gatgccaagt ccaatagcct catctccatc aagaggctga ccttgcagca2520 
gaaggccaag gtgaagttgg actttgtggc cccagccact ggtgcccaca actacactct2580 
gtacttcatg agtgacgctt acatgggatg tgaccaggag tacaaattca gcgtggatgt2640 
gaaagaagct gagacagaca gtgattcaga ttgagtcctg aggcatttac ttttgggtaa2700 
aggagagttg agcctgaatt aggaatgtgt acattgtagg aatcctggtt gtggggacca2760 
ggtctgtggg cctcaggtct ggccagccag ggctggtgct gtccccgcct acctccactt2820 
cctttccctt gctcactctg gatccagtga cagcaggtgt catgggtcaa gcataaatca2880 
tatatagcat tttcaggcat gttcctggta gttcttttga gtctgacatt ctaataaaat2940 
aatttgtaga aaaaaaacca aaaaaaaaaa aa 2972 

(2) INFORMATION UBER SEQ ID NO: 28: 

(i) SEQUENZ CHARAKTER1STIK: 

(A) LANGE: 496 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 



(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 28 



ctcgagccga agagtcctgg acctcctgtg 
tctcctggtg gcagctccca 1 gatgggtcct 
aggactggtg aagccttcgg agaccctgtc 
cagtagttac tactggagct ggatccggca 
gtatatct-at tacagtggga gcaccaacta 
atcagtagac acgtccaaga accagttctc 
cacggccgtg tattactgtg cgagacaggg 
ccagggaacc ctggtcaccg tctcctgagc 
ggcaccctgc tccaag 



caagaacatg aaacatctgt ggttcttcct 60 
gtcccaggtg cagctgcagg agtcgggcccl20 
cctcacctgc actgtctctg gtggctccatl80 
gcccccaggg aagggactgg agtggattgg240 
caacccctcc ctcaagagtc gagtcaccat300 
cctgaagctg agctctgtga ccgctgcgga360 
tatagcagtg gaccagcttg actactgggg420 
ctgcaccaag gggccatcgg tcttccccct480 

496 



(2) INFORMATION UBER SEQ ID NO: 29: 
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(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 397 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOlTYP: aus einzelnen ESTs durch Assemblierung und Editlerung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 29 

gaggtcctgg acctcctgtg caagaacatg aaacacctgt ggttcttcct cctcctggtg 60 
gcagctccca gatgggtcct gtcccaggtg- cagctgcagg agtcgggccc aggactggtgl20 
aagccttcgg agaccctgtc cctcacctgc actgtctctg gtggctccat cagtagttacl80 
tactggagct ggatccggca gcccgccggg aagggactgg agtggattgg gcgtatctat240 
accagtggga gcaccaacta caacccctcc. ctcaagagtc gagtcaccat gtcagtagac300 
acgtccaaga accagttctc cctgaagctg agctctgtga ccgccgcgga cacggccgtg360 
tattactgtg cgagagcaaa acgcagctgg acctcag 397 



(2) INFORMATION UBER SEQ ID NO: 30: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 772 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NOf 3(5 

gggactcctc agttcacctt ctcacaatga ggctccctgc tcagctcctg gggctgctaa 60 
tgctctgggt ctctggatcc agtggggata ttgtgatgac tcagtctcca ctctccctgcl20 
ccgtcacccc tggagagccg gcctccatct cctgcaggtc tagtcagagc ctcctgcatal80 
gtaatggata caactatttg gattggtacc tgcagaagcc agggcagtct ccacagctcc240 
tgatctattt gggttctaat cgggcctccg gggtccctga caggttcagt ggcagtggat300 
caggcacaga ttttacactg aaaatcagca gagtggaggc tgaggatgtt ggggtttatt360 
actgcatgca agctctacaa actcctctca ctttcggcgg agggaccaag gtggagatca420 
aacgaactgt ggctgcacca tctgtcttca tcttcccgcc atctgatgag cagttgaaat480 
ctggaactgc ctctgttgtg tgcctgctga ataacttcta tcccagagag gccaaagtac540 
agtggaaggt ggataacgcc ctccaatcgg gtaactccca ggagagtgtc acagagcagg600 
acagcaagga cagcacctac agcctcagca gcaccctgac gctgagcaaa gcagactacg660 
agaaacacaa agtctacgcc tgcgaagtca cccatcaggg cctgagcttg gcccgttaag720 
aaagggcttt caacaggggg aagtttttag aggggagatg tggccccacc tt 772 



(2) INFORMATION UBER SEQ ID NO: 31: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1031 Basenpaare 

(B) TYP: Nuklelnsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 31 



ggggacggaa gcgcagagca cggaccccgc 
ggggcgccgg cctccgcccg gccccgaggg 
ccccatgggc aactgccagg cagggcacaa 
tctggtctgt gccactttga tgctgctgct 
cttcctcctc acccacagga ctggcctgcg 
tttttgagat cttttggcca gctgaccctg 
tggcgagggt ctcacgtcgt gggcttgctg 
aggaacaaga cccggacatt ccaggccaca 
tcttctgcag gacaactggt ccttatcaca 
acctgcctat attttagtgc tgttccagga 
tgctcagagg agggggctgg aaatgccacc 
agtgtctgga gccatgaagg ccttgtgctg 
ctgtgtggct ccaggctgct ggtcttgggc 
tgctgtgtca ctgctatgtg cttccacccg 
ctctgagggc actggcctag ttcccgactt 



cccctcgcgg ccccgctcgt gacgtcgcgg 60 
gqtctccccg gaggctcagc cccctctgct 120 
cctgcacctg tgtctggccc accacccacc 180 
ccttggcctc tctggcctgg gccttggcag 240 
cagcctgaca tcccccagga ctgggtctct 300 
tgtcccagga atgggacagt cacagggaag 360 
accaccttga acttcggaga cggtccagac 420 
gtcctgggaa gtcagatggg attgaaagga 480 
gccagggtga ccacagaaag gactgcagga 540 
atcctaccct ccagccagcc acccatatcc 600 
ctgagcccta gaatgggtga ggaatgtgtt 660 
accaagctgc tcacctcgga ggagctggct 720 
tccttcctgc ttctcttctg tggccttctc 780 
cgccgggagt cccactggtc tagaacccgg 840 
gtttctcagg tgtgaatcaa cttcttgggc 900 
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cttggctctg agttggaaaa ggttttagaa aaactgaaga . gctogaatgt gg.jggaaaat 960 
aaaaaqcttt tttgcccaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaal020 

1031 

aaaaaaaaaa a 

(2) INFORMATION UBER SEQ ID NO: 32: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 739 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIB linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vl) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii> SONSTIGE HERKUNFT: 
(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 32 

cggctcgagc cccgctcagt cacccgcagc aggcgtgcag tttcccggct ctccgcgcgg 60 
ccggggaagg tcagcgccgt aatggcgttc ttggcgtcgg gaccctacct gacccatcagl20 
caaaaggtgt tgcggcttta taagcgggcg ctacgccacc tcgagtcgtg gtgcgtccagl80 
agagacaaat accgatactt tgcttgtttg atgagagccc ggtttgaaga acataagaat240 
gaaaaggata tggcgaaggc cacccagctg ctgaaggagg ccgaggaaga attctggtac300 
cgtcagcatc cacagccata catcttccct gactctcctg ggggcacctc ctatgagaga360 
tacgattgct acaaggtccc agaatggtgc ttagatgact ggcatccttc tgagaaggca420 
atgtatcctg attactttgc caagagagaa cagtggaaga aactgcggag ggaaagctgg480 
gaacgagagg ttaagcagct gcaggaggaa acgccacctg gtggtccttt aactgaagct540 
ttgccccctg cccgaaagga aggtgatttg cccccactgt ggtggtatat tgtgaccaga600 
ccccgggagc ggcccatgta gaaagagaga gacctcatct ttcatgcttg caagtgaaat660 
atgttacaga acatgcactt gccctaataa aaaatcagtg aaatggaaaa aaaaaaaaaa720 
aaaaaaaaaa aaaaaaaaa 



(2) INFORMATION UBER SEQ ID NO: 33: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 651 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 



85 



• DE 198 18 619 A 1 



(Hi) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 33 

cggctcgagc ctcagttcac cttctcacca tgaggctccc tgctcagctc ctggggctgc 60 
taatgctctg ggtccctgga tccagtgagg atattgtgat gacccagact ccactctcccl20 
tgcccgtcac ccctggagag ccggcctcca tctcctgcag gtctagtcag agcctcttggl80 
atagtgatga tggaaacacc tatttggact ggtacctgca gaagccaggg cagtctccac240 
agctcctgat ctatacgctt tcctatcggg cctctggagt cccagacagg ttcagtggca300 
gtgggtcagg cactgatttc acactgaaaa tcagcagggt ggaggctgag gatgttggag360 
tttattactg catgcaacgt atagaatttc cttacacttt tggccagggg accaagctgg420 
agatcaaacg aactgtggct gcaccatctg tcttcatctt cccgccatct ggatgagcag480 
ttgaaatctg gaacttgcct ctgttgttgt gcctgcttga ataactttct attcccagag540 
aggggcaaag taacagtgga aggttggatt aacgccctgc aattcgggta actgcccagg600 
gagtagtttt cacagggcag gggcagcaag gacagcacct acagtcttag t 651 

(2) INFORMATION UBER SEQ ID NO: 34: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 823 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergesteltte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 34 

ccgcgtcgac aaattttttt aaagatcatc gatgaagaga gaaaatgcgc ttttctacag 60 
agtccccttc ccacccacag ccccatcccc agataagcgg ggagttccct ggcgcggtgcl20 
cagtttctag ccgctgagtg ggcgtgtgcg cggctccaag tgcgcctgcg tactgctcaol80 
tccccagctc cgcgccctgc tccgttcotc ccaaaactct gaatcgaaga actttccgga240 
agtttctgag agcccagacc ggcgggcacg cgcccatccc caaccccctc tgttaatccc300 
taccagcctg cagtcctggc tgcttccaag caggaggtgg ggcctctggc ctagcggggc360 
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cgaaaggcag tgcccctccc ccgcagtctg 
aggagcggca . ggacgagcat ggctacatct 
cccccggtgt ggaccccacc caagtttcct 
tggaggcccc catgcccaag ctagccacgc 
tcgagtcgcg ggcccagctt gggggcccag 
agtaaagcct tagcccggat. gcccacccct 
acctgtgtgt tcttttgata catttatctt 
cacctggtca aaaaaaaaaa aaaaaaaaaa 



atttccotct- tccccccaac ggcaagcacg!20 

cccggtgctt cacgcggaaa tacacgctgc480 

cctccctgtc ccctgagggc acactgaccg540 

agtccaacga gatcaccatc ccagtcacct600 

aagctgcaaa atccgatgag actgccgcca660 

gctgccgcca ctggctgtgc ctcccccgcc720 

ctgtttttct caaataaagt tcaaagcaac780 

aaaaaaaaaa aaa 823 



(2) INFORMATION UBER SEQ ID NO: 35: 

(j) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 457 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Edfoerung 
hergesteltte partielle cDNA 



(Hi) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 35 

cataagggaa atgctttctg agagtcatgg atctcatgtg caagaaaatg aagcacctgt 60 
ggttcttcct cctgctggtg gcggctccca gatgggtcct gtcccagctg cagctgcaggl20 
agtcgggccc aggactggtg aagccttcgg agaccctgtc cctcacctgc actgtctctgl80 
gtggctccat cagcagtagt agttactact ggggctggat ccgccagccc ccagggaagg240 
ggctggagtg gattgggagt atctattata gtgggagcac ctactacaac ccgtccctca300 
agagtcgagt caccatatcc gtagacacgt ccaagaagta cttctccctg aagctgagct360 
ctgtgaccgc cgcagacacg gctgtgtatt actgtgcgag acatgactgg tattacgata420 
ttttgactgg ttatgcgaaa cccggcacag gttcgac 457 

(2) INFORMATION UBER SEQ ID NO: 36: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1203 Basenpaare 

(B) TYP: Nukleinsaure 

(C) "STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Edltierung 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 
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(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNlA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 36 

gtcgggggcg cctgcgcagt. cgctcttcct caggcggcgg ccatggcggg acaggaggat 60 
ccggtgcagc gggagattca ccaggactgg gctaaccggg agtacattga gataatcacc 120 
agcagcatca agaaaatcgc agactttctc aactcgttcg atatgtcttg tcgttcaaga 180 
cttgcaacac taaacgagaa attgacagcc cttgaacgga gaatagagta cattgaagct 240 
cgggtgacaa aaggtgagac. actcacctag aacagtgccg tgctgctgct gggaagttgc 300 
tttacacaac acaggccaca tgggaaaggc cccagcagcc ttcagctcct tcctttctcc 360 
ttaaagagca acagggctta ttcttgtttt tcttttttca aaagtgtggc ctttgggctc 420 
tgccatctgg ggtgtggtgt ggtatgtggg aagaagttca gaggaaccgt tggaaacgac 480 
gttaggcatt ttaccttttc agtaacattt tatacatcta cttgtcaatg tatttgagac 540 
attcacagcc aaaagcctgg gactctttgt gaaggtcctc ctcacctcta tctttctttc 600 
tctctctctc aaactttcct taaagttctc attgcctttg cactgcttct gtgaacagtc 660 
tttgtctcct ccccaccttt ggtgggaagt gcggggcagt cctggtcaag acactcatgc 720 
cctggcaatg.tggctgccag agaatgttgt tgctaaccca ccagtttctt gttgatttgg 780 
agaggtcaag gccaggcccc cacttggctt gaagggacat tttcagactt ttctttctgt 840 
cacttggagt gtctatgcct ctcatatttc cctaataaac tcctcaactt tttatctgac 900 
tgctgtgatt atggtgggga gaggagctag agatgggttc acttattgca cagaaatgta 960 
atacatggcg ttattattct aacataaaac tttcagatgt agctgtttga ttcaaagcctl020 
aggtgcttac cagcccaagt ccccatgttt ggactttcag ctgactagct catcttgggal080 
atcatttggt cattcagcac atttaccaag tatttactat gtaggcatgt taaactccaall40 
taaaacatac agcattgaat cagaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaal200 



(2) INFORMATION UBER SEQ ID NO: 37: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 207 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHET1SCH: NEIN 

(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 



(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 



• DE 198 18 619 Al 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 37 

cggctcgagg ccgccctcgg tgtcagtgtc cccaggacag acggccagga tcacctgctc 60 
tggacatgca ttgccaaagc aatgatgctt attggtacca gtcagaggcc agggccaggcl20 
ccctgtgctt ggtggatccc ttgaaagaac attggaggag. ggcccttcag ggcatgccctl80 
ggagacggat tgctctgggc ttccaac 207 

(2) INFORMATION UBER SEQ ID NO: 38: 

(i) SEQUENZ CHARAKTERIST1K: 

(A) LANGE: 346 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte parh'elle cDNA 

(HI) HYPOTHETISCH: NEIN 
(III) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONST1GE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 38 

cggctcgagc ggctcgaggc cgggatggtg ggtgctacgc cccttgggta ctggggccag 60 
ggaaccctgg tcaccgtctc ctcagcctcc accaagggcc catcgggctt cccccgggcal20 
ccctcctcca agagcacctc tgggggcaca gcggccctgg gctgcctggt caaggactacl80 
ttccccgaac cggtgacggt gtcgtggaac tcaggcgcct gaccagcggc gtgcacacct240 
tcccggctgt ctacagtctc aggactctac tcctcagcag cgtggtgacg tgccctccag300 
cagttgggca ccagacctac atctgcaagt gaatcgaagc cagcaa 346 



(2) INFORMATION UBER SEQ ID NO: 39: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 926 Basenpaare 

(B) TYP: Nukleinsiure 

(C) STRANG: einzel 

(D) JOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
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(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONST1GE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 39 



cggctctaag gaagcagcac tggtggtgcc tcagccatgg cctggaccgt tctcctcctc €0 
ggcctcctct ctcactgcac aggctctgtg acctcctatg tgctgactca gccaccctcgl20 
gtgtcagtgg ccccaggaca gacggccagg attacctgtg ggggaaacaa cattggaagtlSO 
aaaagtgtgc actggtacca gcagaagcca ggccaggccc ctgtgctggt cgtctatgat240 
gatagcgacc ggccctcagg gatccctgag cgattctctg gctccaactc tgggaacacg300 
gccaccctga ccatcagcag ggtcgaagcc ggggatgagg ccgactatta ctgtcaggtg360 
tgggatagta .gtagtgatca ttgggtgttc ggcggaggga ccaagctgac cgtcctaggt420 
cagcccaagg. ctgccccctc ggtcactctg ttcccgccct cctctgagga gcttcaagcc480 
aacaaggcca cactggtgtg tctcataagt gacttctacc cgggagccgt gacagtggcc540 
tggaaggcag atagcagccc cgtcaaggcg ggagtggaga ccaccacacc ctccaaacaa600 
agcaacaaca agtacgcggc cagcagctat ctgagcctga cgcctgagca gtggaagtcc660 
cacagaagct acagctgcca ggtcacgcat gaagggagca ccgtggagaa gacagtggcc720 
cctacagaat gttcataggt tctcaaccct caccccccac cacgggagac tagagctgca780 
ggatcccagg ggaggggtct ctcqtcpcac cccaaggcat caagcccttc tccctgcact840 
caataaaccc tcaataaata ttctcattgt caatcaggaa aaaaaaaaaa aaaaaaaaaa900 
aaaaaaaaaa aaaaaaaaaa aaaaaa 926 

(2) INFORMATION UBER SEQ ID NO: 40: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 2384 Basenpaare 

(B) TYP: Nuklelnsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(il) MOLEKULTYP: aus einzelnen ESTs durch Assembllerung und Editierung 
hergestellte partielle cDNA 

(HI) HYPOTHETISCH: NEIN 



(Hi) ANTI-SENSE: NEIN * 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 40 

gcctcccgcc cgccgcctct gtctccctct ctccacaaac tgcccaggag tgagtagctg 60 
ctttcggtcc gccggacaca ccggacagat agacgtgcgg acggcccacc accccagccc 120 
gccaactagt cagcctgcgc ctggcgcctc ccctctccag gtccatccgc catgtggccc 180 
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ctgtggcgcc tcgtgfcctct gctggccctg agccaggccc tgccctjttCja g<:agp.gaggc 2 40 
ttctgggact tcaccctgga cgatgggcca ttcatgcitga scgat^a'gga "agcttcgggc 300 
gctgacacct cgggcgtcct ggacccggac tctgtcacac ccacctacag cgccatgtgt 360 
cctttcggct gccactgcca cctgcgggtg gttcagtgct ccgacctggg tctgaagtct 420 
gtgcccaaag agatctcccc tgacaccacg ctgctggacc tgcagaacaa cgacatctcc 480 
gagctccgca aggatgactt caagggtctc cagcacctct acgccctcgt cctggtgaac 540 
aacaagatct ccaagatcca tgagaaggcc ttcagcccac tgcggaagct gcagaagctc 600 
tacatctcca agaaccacct ggtggagatc ccgcccaacc tacccagctc cctggtggag 660 
ctccgcatcc acgacaaccg catccgcaag gtgcccaagg gagtgttcag tgggctccgg 720 
aacatgaact gcatcgagat gggcgggaac ccactggaga acagtggctt tgaacctgga 780 
gccttcgatg gcctgaagct caactacctg cgcatctcag aggccaagct gactggcatc 840 
cccaaagacc tccctgagac cctgaatgaa ctccacctag accacaacaa aatccaggcc 900 
atcgaactgg aggacctgct tcgctactcc aagctgtaca ggctgggcct aggccacaac 960 
cagatcagga tgatcgagaa cgggagcctg agcttcctgc ccaccctccg ggagctccacl020 
ttggacaaca acaagttggc cagggtgccc tcagggctcc cagacctcaa gctcctccagl080 
gtggtctatc tgcactccaa caacatcacc aaagtgggtg tcaacgactt ctgtcccatgl!40 
ggcttcgggg tgaagcgggc ctactacaac ggcatcagcc tcttcaacaa ccccgtgcccl200 
tactgggagg tgcagccggc cactttccgc tgcgtcactg actgcctggc catccagtttl260 
ggcaactaca aaaagtagag gcagctgcag ccaccgcggg gcctcagtgg gggtctctggl320 
ggaacacagc cagacatcct gatggggagg cagagccagg aagctaagcc agggcccagc!380 
tgcgtccaac ccagcccccc acctcgggtc cctgacccca gctcgatgcc ccatcaccgcl440 
ctctccctgg ctcccaaggg tgcaggtggg cgcaaggccc ggcccccatc acatgttcccl500 
ttggcctcag agctgcccct gctctcccac cacagccacc cagaggcacc ccatgaagctl560 
tttttctcgt tcactcccaa acccaagtgt ccaaggctcc agtcctagga gaacagtcccl620 
tgggtcagca gccaggaggc. ggtccataag aatggggaca gtgggctctg ccagggctgcl680 
cgcacctgtc cagacacaca tgttctgttc ctcctcctca tgcatttcca gcctttcaacl740 
cctccccgac tctgcggctc ccctcagccc ccttgcaagt tcatggcctg tccctcccagl800 
acccctgctc cactggccct tcgaccagtc ctcccttctg ttctctcttt ccccgtccttl860 
cctctctctc tctctctctc tttctgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtgl920 
tgtgtgtgtc ttgtgcttcc tcagaccttt ctcgcttctg agcttggtgg cctgttccctl980 
ccatctqtcc gaacctggct tcgcctgtcc ctttcactcc acaccctctg gccttctgcc2040 
ttgagctggg actgctttct gtctgtccgg cctgcaccca gcccctgccc acaaaacccc2100 
agggacagcg gtctccccag cctgccctgc tcaggccttg cccccaaacc tgtactgtcc2160 
cggaggaggt tgggaggtgg aggcccagca tcccgcgcag atgacaccat caaccgccag2220 
agtcccagac accggttttc ctagaagccc ctcaccccca ctggcccact ggtggctagg2280 
tctcccctta tccttctggt ccagcgcaag gaggggctgc ttctgaggtc ggtggctgtc2340 
tttccattaa agaaacaccg tgcaacgtga aaaaaaaaaa aaaa . 2384 



(2) INFORMATION UBER SEQ ID NO: 41 : 

(I) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 334 Basenpaare 

(B) TYP: NukleinsSure 

(C) STRANG: elnzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assembllerung und Edltierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vl) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 



91 



# DE 198 18 619 A 1 



(A) BIBUOTHEK: cDNA library 
(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 41 

ctcgagccga attcggctcg agaggagccc agccctggga ttttcaggtg ttttcatttg 60 
gtgatcagga ctgaacagag agaactcacc atggagtttg ggctgagctg gctttttcttl20 
gtggctattt taaaaggtgt ccagtgtgag gtgcagctgt tggagtctgg gggaggcttgl80 
gtacagcctg gggggtccct gagactctcc tgtgcagcct ctggattcac ctttagcagc240 
tatgccatga gctgggtccg ccaggctcca .gggaaggggc tggagtgggt ctcaggtatt300 
agtggtagtg gtgtgatagt acacactacg caga 334 

(2) INFORMATION UBER SEQ ID NO: 42: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 845 Basenpaare 

(B) TYP: NuWeinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 
(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 
(A) BIBLIOTHEK: cDNA library 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 42 

gcgttccctc cgccgagcta cttctttctt tccttttttt tttttttctg gctaacagaa 60 
ttttattgtt aaatcacaga aactttagtg caaaacaaaa atcacgaagt ccatttaatal20 
gcaacttcat gtcctgctgg ctttgcttgc tgtctcctgg caaccagaag tggacagaagl80 
cgtgggtgcc caagtgggcc acagacagct tccaaccccc acaccccagc atccaatcca240 
cacccagcag acccttcggc atgccgccct ctaccaggaa gccagaggcc taggagctcg300 
ccatccatat* ttatttgaaa aggtcaaaag gagcatctat gagacaaggg aggggtgcag360 
gctgaagcag cgcctcaaca gccagggaca tgtaggcaac acgagcaggc acagcgcggc420 
caeca ctgtc cacacgctca cacaagccag gcccgcaggg ectteggaga gctagcaggt480 
tacattcagg cagatggccc tcttcccacc caaacccaca gaaccccaaa caaggcatca540 
ccaggaaaga caegggaaag ccaaatcaca gttgaaccag ggacagagaa cccttggccc600 
cactgatgtc ccaagccacc ageagctget tccaaaatcc etatgetatt acagtgggaa660 
ttacatcatt taaaaagect gattattccc aggcttctaa tctttcatat aaaactgcct720 
ttgttttgct cctttgttca actcagaggc ccagcaaagc gggcagggtc cctgatcagg780 
gcaggagccc acctcagaag cccatgccgc accagtgccc aagcacatgt cagtgctcag840 
aacaa 845 



(2) INFORMATION UBER SEQ ID NO: 43: 
(i) SEQUENZ CHARAKTERISTIK: 
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(A) LANGE: 2233 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einze! 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editieaing 
hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 43 

gaattcagaa gttaatgatg ttgggtaaga gaacaatggt aagagagcaa tctaagaata 60 
tatcacctac tttaatttta tatgagagta catggaggta gctgtgatgt ggaaatgtag 120 
cactgctcct acccacgcag atttattcca gtgaaacaac aactggaact tcaagtaact 180 
cctcccagag tacttccaac tctgggttgg ccccaaatcc aactaatgcc accaccaagg 240 
cggctggtgg tgccctgcag tcaacagcca gtctcttcgt ggtctcactc tctcttctgc 300 
atctctactc ttaagagact caggccaaga aacgtcttct aaatttcccc atcttctaaa 360 
cccaatccaa atggcgtctg gaagtccaat gtggcaagga aaaacaggtc ttcatcgaat 420 
ctactaattc cacacctttt attgacacag aaaatgttga gaatcccaaa tttgattgat 480 
ttgaagaaca tgtgagaggt ttgactagat gatggatgcc aatattaaat ctgctggagt 540 
ttcatgtaca agatgaagga gaggcaacat ccaaaatagt taagacatga tttccttgaa 600 
tgtggcttga gaaatatgga cacttaatac taccttgaaa ataagaatag aaataaagga 660 
tgggattgtg gaatggagat tcagttttca tttggttcat taattctata aggccataaa 720 
acaggtaata taaaaagctt ccatgattct atttatatgt acatgagaag gaacttccag 780 
gtgttactgt aattcctcaa cgtattgttt cgacagcact aatttaatgc cgatatactc 840 
tagatgaagt tttacattgt tgagctattg ctgttctctt gggaactgaa ctcactttcc 900 
tcctgaggct ttggatttga cattgcattt gaccttttat gtagtaattg acatgtgcca 960 
gggcaatgat gaatgagaat ctacccccag atccaagcat cctgagcaac tcttgattatl020 
ccatattgag tcaaatggta ggcatttcct atcacctgtt tccattcaac aagagcactal080 
cattcattta gctaaacgga ttccaaagag tagaattgca ttgaccgcga ctaatttcaall40 
aatgcttttt attattatta ttttttagac agtctcactt tgtcgcccag gccggagtgcl200 
agtggtgcga tctcagatca gtgtaccatt tgcctcccgg gctcaagcga ttctcctgccl260 
tcagcctccc aagtagctgg gattacaggc acctgccacc atgcccggct aatttttgtal320 
attttagtag agacagggtt tcaccatgtt gcccaggctg gtttcgaact cctgacctcal380 
ggtgatccac ccgcctcggc ctcccaaagt gctgggatta caggcttgag cccccgcgccl440 
cagccatcaa aatgctttt ( t atttctgcat atgttgaata ctttttacaa tttaaaaaaalSOO 
tgatctgttt tgaaggcaaa attgcaaatc ttgaaattaa gaaggcaaaa atgtaaaggal560 
gtcaaaacta taaatcaagt atttgggaag tgaagactgg aagctaattt gcattaaattl620 
cacaaacttt tatactcttt ctgtatatac attttttttc tttaaaaaac aactatggatl680 
cagaatagcc acatttagaa cactttttgt tatcagtcaa tatttttaga tagttagaacl740 
ctggtcctaa gcctaaaagt gggcttgatt ctgcagtaaa tcttttacaa ctgcctcgacl800 
acacataaac ctttttaaaa atagacactc cccgaagtct tttgttcgca tggtcacacal860 
ctgatgctta gatgttccag taatctaata tggccacagt agtcttgatg accaaagtccl920 
tttttttcca tctttagaaa actacatggg aacaaacaga tcgaacagtt ttgaagctacl980 
tgtgtgtgtg aatgaacact cttgctttat tccagaatgc tgtacatcta ttttggattg2040 
tatattgtgt ttgtgtattt acgctttgat tcatagtaac ttcttatgga attgatttgc2100 
attgaacaca aactgtaaat aaaaagaaat. ggctgaaaga gcaaaaaaaa aggagggcag2160 
gagagaggaa aaggggagga agaggagggg ggaaagagaa gggagagaga aggaggggga2220 
aggaggtggg ggg 2233 
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(2) INFORMATION UBER SEQ ID NO: 44: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 243 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch AssembHerung und Edit'erung 

hergestellte partielle cDNA 



(Hi) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 44 

ggagcccagc actagaagtc ggcggtgttt ccattcggtg atcagcactg aacacagagg 60 
actcaccatg gagtttgggc tgagctgggt tttcctcgtt gctcttttaa gaggtgtccal20 
gtgtcaggtg cactggtgga gcggggagcg ggtcagcagg agtcctgaat cctgtgacgcl80 
tgatcagtoc tatatcagat ggcgcagctc agcagggtga tggggtatga atgataacat240 



(2) INFORMATION UBER SEQ ID NO: 45: 

(I) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 817 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch AssembHerung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 



(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vu) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 45 

gttttttttt tttttttttg aagagagcag attctcttta ttgagatacg ggacacagcg 60 
aagggtggag agacggaaca gccccccagc ctcagccctc tccacggggg ccggatgccal20 
gagatgggag aagggattca gtctctcgcc cgggaaaccc agtcccacag agggcgccggl80 
caagggtggg acgcgacctg ggtgacacgg tgcagggagt ctttaaatag aggaggggct240 
ggagcgggga aacgcgccgg ggccctagcg caccatgtat tccttgcgct tattgagccg300 
aacttggcag aaagagaagc ctccgaggag gaggtaaagg cctgcagcga tgaaacagtt360 
gtagctgact tgctcgtaaa ggttgtatat gttctggggg ccattctcaa aatctttctc420 
cgtgaaggga acgtcctcaa tcaacacagc ggaatggaca ttgaaaaata ttccgagcat480 
tatcaacatg atcactcccc aggcgctgag gacgatgccg caggcggcca gcttcggccc540 
acagcacagg agcgacgcca taaagaaggg agtcggggat cgccgaggtg caagcgggct600 
cggaaagcgg tgggagaaag cccaggatgc cctcgcaggg gggcagaggg ggcgtggccc660 
cggcctcaac catcccatcc gggggcggca ggcggaaaag gctgggctcc tctcaggact720 
ttcgcgggag acggcgccgt ctgaaaccaa. aactgctcct ggggaaacct tccttgacct780 
ctgtagctag ggcgtgagta ttggaagagc gagggcc 817 

(2) INFORMATION UBER SEQ ID NO: 46: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1644 Basenpaare 

(B) TYP: Nukleinsaure - 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte parh'elle cDNA 

(Hi) H YPOTH ETISCH : NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 46 

gttccggctc acatgggaaa tactttctga gagtcctgga cctcctgtgc aagaacatga 60 

aacacctgtg gttcttcctc ctgctggtgg cagctcccag atgggtcctg tcccaggtgc 120 

agctgcagga gtcgggccca ggactggtga agccttcaca gaccctgtcc ctcacctgca 180 

ctgtctctgg tggctccatc agcagtggtg gttactactg gagctggatc cgccagcacc 240 

cagggaaggg cctggagtgg attgggtaca tctattacag tgggagcacc tactacaacc 300 

cgtccctcaa gagtcgagtt accatatcag tagacacgtc taagaaccag ttctccctga 360 

agctgagctc tgtgactgcc gcggacacgg ccgtgtatta ctgtgcgaga gagcatctct 420 

cctacggtga ctcgagatac tactactacg gtatggacgt ctggggccaa gggacccggt 480 

caccgtctcc tcagcatccc cgaccagccc caaggtcttc ccgctgagcc tctgcagcac 540 

ccagccagat gggaacgtgg tcatcgcctg cctggtccag ggcttcttcc cccaggagcc 600 

actcagtgtg acctggagcg aaagggacag ggcgtgaccg ccagaaactt cccacccagc 660 

caggatgcct ccggggacct gtacaccacg agcagccagc tgaccctgcc ggccacacag 720 

tgcctagccg gcaagtccgt gacatgccac gtgaagcact acacgaatcc cagccaggat 780 

gtgactgtgc cctgcccagt tccctcaact ccacctaccc catctccctc aactccacct 840 

accccatctc cctcatgctg ccacccccga ctgtcactgc accgaccggc cctcgaggac 900 



95 



DE 198 18 619 A 1 

ctgctcttag gttcagaagc gaacctcacg tgcacactga ccijgcctgag agatgcctca 96.Q 
ggtgtcacct tcacctggac gccctcaagt gggaaga'Qcg cCgctca'agg accacctgagl020 
cgtgacctct gtggctgcta cagcijtgtcc agtgtcctgc cgggctgtgc cgagccatggl080 
aaccatggga agaccttcac ttgcactgct gcctaccccg agtccaagac cccgctaaccll40 
gccaccctct caaaatccgg aaacacattc cggcccgagg tccacctgct gccgccgccgl200 
tcggaggagc tggccctgaa cgagctggtg acgctgacgt gcctggcacg cggcttcagcl260 
cccaaggacg tgctggttcg ctggctgcag gggtcacagg agctgccccg cgagaagtacl320 
ctgacttggg catcccggca ggagcccagc cagggcacca ccaccttcgc tgtgaccagcl380 
atactgcgcg tggcagccga ggactggaag aagggggaca ccttctcctg catggtgggcl440 
cacgaggccc tgccgctggc cttcacacag aagaccatcg accgcttggc gggtaaaccclSOO 
acccatgtca atgtgtctgt tgtcatggcg gaggtggacg gcacctgcta ctgagccgccl560 
cgcctgtccc cacccctgaa taaactccat gctcccccaa gcaaaaaaaa aaaaaaaaaal620 
aaaaaaaaaa aaaaaaaaaa aaaa 1644 



(2) INFORMATION OBER SEQ ID NO: 47: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1 133 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: elnzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus elnzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 47 

cacatgccaa acagtggttc ttattaaatc 60 
aptgatgctt tgtgtatcta ttttcttctc 120 
aagacaatca ttgaatacaa aacaaataag 180 
cctttggaca tcggtggtgc tgaccaggaa 240 
aaataaatga actcaatcta aattaaaaaa 300 
catttcttct tcttcttttt taactgaaag 360 
cttgcttaaa ttgtgggcaa aagagaaaaa 420 
gtttcattaa ctccttcccc cgctccccca 480 
cacctgttat ggaaaatgtc aacctttgta 540 
aaccataaac atttgcacca cttgtggctt 600 
acccaaactt ccaaaggttt aaactacctc 660 
tgttaggtgc tgacctagac agagatgaac 720 
caaaggtgct aattaatagt atttcagata 780 
ttgagaagaa atactcctgt attgagttgt 840 
tttagcattc atattttcca tcttattccc 900 
tcttcttcag attcagcatt tgttctttgc 960 
cacagaagct. ttgtttcttg ggcaagcagal020 
gagatgttta aataaattgt gaaaaaaatgl080 
aaaaaaaaaa aaaaaagtcg acc 1133 



atttatctgg gacagacatc ttcagaatga 
aaaggttcag atattatcag attcagaaat 
tttaaacaga aaaagacaaa tgaatgggga 
ccatcacgcc tgcccttcct tgatattgca 
ttctttgtgg acattggccc agtctgtttc 
gaaagaaatt tgaaaaaact ttctctttgc 
ctgaatcctt ccatttcttc tgcacatcta 
gaaggattga tcagagcattf gtgcaataca 
aaaatttgaa tttttttttc aacactctta 
agaaaaccaa aataaaaatt gaaaaataaa 
ttgaatatdt tccacagagg gaagtttaaa 
aaaacacttt cccatgagtg tgatccacat 
tgaggtcctt gttttgtttt gttcataata 
cttgaagaat gttgatggtg ctagaagaat 
atcgtgtggt gtatttttta aaaaatttga 
aattaaaagt atgcagatta tttgcccaaa 
cagtctcatt ttcatcttct tccatggttc 
aaaattaaat tgtacctatt ttgtatatgt 
aaataaagca tgtttggttt tccaaaagaa 
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(2) INFORMATION UBER SEQ ID NO: 48: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 969 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblieruhg und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 48 

gaggaggagg gtgtatctcc tttcgtcgga ccgccccttg gcttctgcac tgatggtggg 60 

tggatgagta atgcatccag gaagcctgga ggcctgtggt ttccgcaccc gctgccacccl20 

ccgcccctag cgtggacatt tatcctctag cgctcaggcc ctgccgccat cgccgcagatl80 

ccagcgccca gagagacacc agagaaccca . ccatggcccc ctttgagccc ctggcttctg240 

gcatcctgtt gttgctgtgg ctgatagccc ccagcagggc ctgcacctgt gtcccacccc300 

acccacagac ggccttctgc aattccgacc tcgtcatcag ggccaagttc gtggggacac360 

cagaagtcaa ccagaccacc ttataccagc gttatgagat caagatgacc aagatgtata420 

aagggttcca agccttaggg gatgccgctg acatccggtt cgtctacacc cccgccatgg480 

agagtgtctg cggatacttc cacaggtccc acaaccgcag cgaggagttt ctcattgctg540 

gaaaactgca ggatggactc ttgcacatca ctacctgcag tttcgtggct ccctggaaca600 

gcctga.gctt agctcagcgc cggggcttca ccaagaccta cactgttggc tgtgaggaat660 

gcacagtgtt tccctgttta tccatcccct gcaaactgca gagtggcact cattgcttgt720 

ggacggacca gctcctccaa ggctctgaaa agggcttcca gtcccgtcac cttgcctgcc780 

tgcctcggga gccagggctg tgcacctggc agtccctgcg gtcccagata gcctgaatcc840 

tgcccggagt ggaagctgaa gcctgcacag tgtccaccct gttcccactc ccatctttct900 

tccggacaat gaaataaaga gttaccaccc agcaaaaaaa aaaaaaaaaa acaagtcgtc960 
gcgtgctgt 969 



(2) INFORMATION UBER SEQ ID NO: 49: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 617 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 
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(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

i 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 49 

cctacaccta ccctcccttt gggtttctat tcggaccgcg atgatttgct ttggaaggct 60 
taaccccctt cttccccaaa cttgcccccg gagaaccccc agccttacga ccctcctcctl20 
gaagatgcaa aaccagcttg ccggccgcgc tctcttccag gacatcaaga agccagctgal80 
agatgagtgg ggtaaaaccc cagacgccat gaaagctgcc atggccctgg agaaaaagct240 
gaaccagggc cttttggatc ttcatgccct gggttctgcc cgcacggacc cccatctctg300 
tgacttcctg gagactcact tcctagatga ggaagtgaag cttatcaaga agatgggtga360 
ccacctgacc aacctccaca ggctgggtgg cccggaggct gggctgggcg agtatctctt420 
cgaaaggctc actctcaagc acgactaaga gccttctgag cccagcgact tctgaagggc480 
cccttgcaaa gtaatagggc ttctgcctaa gcctctccct ccagccaata ggcagctttc540 
ttaactatcc taacaagcct tggaccaaat ggaaataaag ctttttgatg caaaaaaaga600 
ggagggggga aaaaagc 617 

(2) INFORMATION UBER SEQ ID NO: 50: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 704 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 50 

ggggagactc gtcaccaggc gtgcagtggg cactgctggg ctcccccatc ccgtcctaac 60 
ccggaacagc cccgggcagg aggcgtggaa agtcgagggg gtaaaccgcg aatgtgcgttl20 
gtgtaagcca cggcgcaggg tggggcgcgg gcgggacttg ggcgggcggg gtgggcttggl80 
ccgagctggc ctccggggca ccgaccgcta taaggccagt cggactgcga cacagcccat240 
cccctcgacc gctcgcgtcg catttggccg cctccctacc gctccaagcc cagccctcag300 
ccatggcatg ccccctggat caggccattg gcctcctcgt ggccatcttc cacaagtact360 
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ccggcaggga gggtgacaag cacaccctga gcaagaagga gcfcgaaggag ctgatccaga42C 
aggagctcac cattggctcg aagctgcagg atgctgaaat tgcaaggctg atggaagact480 
tggaccggaa caaggaccag gaggtgaact tccaggagta tgtcaccttc ctgggggcct540 
tggctttgat ctacaatgaa gccctcaagg gctgaaaata aatagggaag atggagacac600 
cctctggggg tcctctctga gtcaaatcca gtggtgggta attgtacaat aaattttttt660 
tggtcaaatt taaaaaaaaa aaaaaaagag aaaaaagggt gage 704 

(2) INFORMATION OBER SEQ ID NO: 51: 

(A) LANGE: 95 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 51: 

PCSSQFHPVE NRSQEPIAGD SMSPRTLPVQ NMNNAMFLQK TLSLSFIGGN HQTTAECRTL60 
SRTTDLSPSH SPYHHKSHNK KEKRYFGFKK SKKIM 95 

(2) INFORMATION OBER SEQ ID NO: 52: 

(A) LANGE: 76 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 52: 

LPRDTWFKLK CLTDHSRHVL £SYVNVSHLT WVHCLQTEHR LPLAWFENRN RAMPTDPSYV60 
WASKWNCTFI QIFTCL 76 

(2) INFORMATION OBER SEQ ID NO: 53: 

(A) LANGE: 90 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 



9? 



# DE 198 18 619 A 1 # 

(iii) HYPOTHETISCH: ja 



5 (vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 53: 

RVNNCQEQLV VLKYNPQPRI PPVLQMDQLK QANTEDTKNE VRFIETRVTP LDELNTKMTL6 0 
TLSRYRSSET CLQNEIPEEF CSYPEIRGSN 90 

15 (2) INFORMATION UBER SEQ ID NO: 54: 

(A) LANGE: 117 Aminosauren 
„ (B)TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

25 (ii) MOLEKULTYP: ORF 



(iii) HYPOTHETISCH: ja 



30 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 

35 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 54: 

PLQPPRAMAP RGCIVAVFAI FCISRLLCSH GAPVAPMTPY LMLCQPHKRC GDKFYDPLQH 60 
CCYDDAWPL ARTQTCGNCT FRVCFEQCCP WTFMVKLINQ NCDSARTSDD RLCRSVS 117 

40 

(2) INFORMATION UBER SEQ ID NO: 55: 

(A) LANGE: 103 Aminosauren 
45 (B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

50 (ii) MOLEKULTYP: ORF 
(iii) HYPOTHETISCH: ja 

55 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 

60 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 55: 

RVPQPALGWC RVDVGHRGHQ EGSESLGPHQ HTHLMLSRIL EGDLWASSGQ RQGGPQTGHR 60 
MKWAVECVFL WPPNSHSASQ ISGNTSLFLQ AHPGRRIQES SFP 103 

65 

(2) INFORMATION OBER SEQ ID NO: 56: 
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(A) LANGE: 81 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 56: 

RCSFHTSGSW PRARRHHHSN SAAGGRRTCP HISCVAGTAS GKESWGPLGL RVSRGAWRCR60 
KWQRQLRCSL GEPWLWWAV E 81 

(2) INFORMATION OBER SEQ ID NO: 57: 

(A) LANGE: 125 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 57: 

RAARADSARA FPLPACKWV PQGPPPGHVG AAGQAFPSFE RGFRCRSRAS GLRSSLPSFR 60 
SWASPPPTH QSRCILGRAL GAMAPRGRKR KAEAAWAVA EKREKLANGG EGMEEAT WT 120 
EHCTS " 125 

(2) INFORMATION UBER SEQ ID NO: .58: 

(A) LANGE: 119 Aminosauren 

(B) TYP: Protein . 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(Ii) MOLEKOLTYP: ORF 
(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 58:' 

QRSPPPFPPR RSPASLASRL RRPPRPQPYA SSRGEPWRLE PGRECSGTGG WGAETRPLSG 60 
NWATKSAARK LCSYSGNLSQ RKGKLGPQHP RGLEADLGAQ PLCKQGAGRL EPNRLERLE 119 

(2) INFORMATION OBER SEQ ID NO: 59: 

(A) LANGE: 128 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 59: 

TRQLVQCSIT TVASSIPSPP FASFSRFSAT ATTAASALRF LPRGAMAPRA RPRMQRDWWV 60 
GGGDATTERK LGNEERSPEA LLLQRKPLSK EGKAWPAAPT WPGGGPWGTT TLQAGSGKAR12 0 
AESARAAR 128 

(2) INFORMATION UBER SEQ ID NO: 60: 

(A) LANGE: 127 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 60: 

VTVMQFNFEL SFKYVLYSSY- SWLKLDHTIA DCMVFTWTPC RMLDYLYSSY ANMLWAGEMK 60 
SSSHQDLLFK WLDNWATKEL ELHLLGFELF WNTLLHFGKS KSSASGALSI ENLPSFALKD120 
VLFFIYT 127 

(2) INFORMATION UBER SEQ ID NO: 61: 

(A) LANGE: 111 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 
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(ii) MOLEKOLTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 61: 

SIGPICSQGL GPGGIPSPIT LIKNGCNCKN PCLIYLQLCS HLQMYLLMLS CQVPMQRWRG 60 
LPLCGWGLWV WKDRYQKNA FKCTNLLINI RCLLKKKKKK KKRVGGVGCI G 111 

(2) INFORMATION UBER SEQ ID NO: 62: 

(A) LANGE: 68 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 62: 

YRSFTTTHKP HPHKGSPRHL CIGTWQLSIR RYICKWEHSC KYIRQGFLQL QPFLIKVIGE60 
GIPPGPRP 68 

(2) INFORMATION UBER SEQ ID NO: 63: 

(A) LANGE: 195 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 63: 

LVQPGGSCSG GRLLGVEFPS APRVRPFERS APAPATSLLG AMTTTTTFKG VDPNSRNSSR 60 
VLRPPGGGSN FSLGFDEPTE QPVRKNKMAS NIFGTPEENQ ASWAKSAGAK SSGGREDLES120 
SGLQRRNSSE ASSGDFLDLK GEGDIHENVD TDLPGSLGQS EEKPVPAAPV PSPVAPAPVP1 8 0 
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SRRNPPGGKS SLVLG 

(2) INFORMATION UBER SEQ ID NO: 64: 

(A) LANGE: 164 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 64: 

VSQSFPSNLL LENTHAMAHR PKSQGQRETC SSKEKKKRQQ YIKCFFLMKQ IQEMYSQAQV 60 
VQFTSMEETD RTTAFRTVRA NPRRGWTCRQ GDFFWMALGP GPPGWAQAQQ ARASLHSAPG120 
CLASLCPHFH EYHLLPSDLR SLRSLLQRSS FSAVQMTPSL PCHH 164 

(2) INFORMATION OBER SEQ ID NO: 65: 

(A) LANGE: 106 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 
HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 65: 

FQAVSLYIQA FLCVRAKALL ISQPVLLLSG YFLRLKNKRQ FLCFAGGKAG GAGLFIVHMS 60 
QEEALSKGHW QVRATPRRLC GETPCGLGPG RNGACGLFMV CPVEAW 106 

(2) INFORMATION UBER SEQ ID NO: 66: 

(A) LANGE: 349 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 
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(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 66: 

AALRS DAGMK RALGRRKGVW LRLRKILFCV LGLYIAIPFL IKLCPGIQAK LIFLNFVRVP 60 
YFIDLKKPQD QGLNHTCNYY LQPEEDVTIG VWHTVPAVWW KNAQGKDQMW YEDALASSHP12 0 
IILYLHGNAG TRGGDHRVEL YKVLSSLGYH WTFDYRGWG DSVGTPSERG MTYDALHVFD180 
WIKARSGDNP VYIWGHSLGT GVATNLVRRL CERETPPDAL ILESPFTNIR EEAKSHPFSV2 4 0 
IYRYFPGFDW FFLDPITSSG IKFANDENVK HISCPLLILH AEDDPWPFQ LGRKLYSIAA300 
PARSFRDFKV QFVPFHSDLG YRHKYIYKSP ELPRILREFL GKSEPEHQH 349 

(2) INFORMATION UBER SEQ ID NO: 67: 

(A) LANGE: 191 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 67: 

SGLSRLGPGR NQHAGQDVLC EVAAALHQVL KELLGQGIDY EKILKLTADA KFESGDVKAT 60 
VAVLSFILSS AAKHSVDGES LSSELQQLGL PKEHAASLCR CYEEKQSPLQ KHLRVCSLRM120 
NRLAGVGWRV DYTLSSSLLQ SVEEPMVHLR LEVAAAPGTP AQPVAMSLSA DKFQVLLAEL180 
KQAQTLMSSL G 191 

(2) INFORMATION UBER SEQ ID NO: 68: 

(A) LANGE: 164 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 68: 

FFFFFFFFFF FFFFSLLYFC LFFLLMKTAN NCLSREGKVM LGKVLRSPEP SSQERSEAAG 60 
DLGGQSPGQG LSILEPGLPP EEQFRGRDSI RAGRLHTGLE HPSPQPRELI RVWACFSSAR120 
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RTWNLSAERD MATGWAGVPG AAATSSRRCT MGSSTDC&RL: ELRV 164 

(2) INFORMATION UBER SEQ ID NO: 69: 

(A) LANGE: 155 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 69: 

NQGCLPKSSS EGVTPYGQGG STQAWNTLLL SPGSSSGSGP ASVLPGGPGT CLLRGTWQQA 60 
GLGSLGQLPP PAAGAPWALP RIAAGWSSGC SPPASPHLPT YSCVGCRPAS AS ARGEASPH12 0 
NSGTGWPRAL WAAPAAAVHW TRIRHRHCAW PHWRG 155 

(2) INFORMATION OBER SEQ ID NO: 70: 

(A) LANGE: 35 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 70: 

RRAAVTWVWL GVLCFESAVF TPTEWRTCR LLRFS 

(2) INFORMATION OBERSEQ ID NO: 71: 

(A) LANGE: 32 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) f OPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 
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(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xl) SEQUENZ-BESCHREIBUNG: SEQ ID NO 71: 



KRLTQNTTPP TRPKSQLHVF KTSFKVSYFS TS 



32 



(2) INFORMATION ClBER SEQ ID NO: 72: 

(A) LANGE: 37 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 72: 

ENRSNLHVLT TSVGVKTADS KHNTPNQTQV TAARLQN 37 

(2) INFORMATION OBER SEQ ID NO: 73: 

(A) LANGE: 121 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 73: 

LVKGMTVLEA VLEIQAITGS RLLSMVPGPA RPPGSCWDPT QCTRTWLLSH TPRRRWISGL 60 
PRASCRLGEE PPPLPYCDQA YGEELSIRHR ETWAWLSRTD TAWPGAPGVK QARILGF.T.T.L120 
V 121 

(2) INFORMATION UBER SEQ ID NO: 74: 

(A) LANGE: 115 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 
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(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 

(vi) HERKUNFT: 

(A) ORGANISMUS:,MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 74: 

QACPWASLAQ GQRTRLRRKL DTPVHGGLGL EGWLSGLEVP GGLPAGTRPS AAGWAVPCCC 60 
CPQGLAWAE DGTLSGWIRS PGSSSSRELR HKAGARLYTC RTQESLLQFL PEAPR 115 

(2) INFORMATION UBER SEQ ID NO: 75: 

(A) LANGE: 117 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 75: 

RWHLIRLDQV TRQQQLSRAE AQGRGPAVHL QDPGEPVAVL ARSAEIASSV SLQQEQNQLW 60 
PRWVGGSAFL AMAAATPRQE TAECLEGCNT RSNRQPPLET. MSDGQALQHL DRHGGWS 117 

(2) INFORMATION UBER SEQ ID NO: 76: 

(A) LANGE: 66 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 76: 

PPQERRTIEV LYPRGSGREN MESGFYRLIG PIHKGHDWEK VWEQKENWDF RVQYAHPKLL60 
VAWGMS 66 

(2) INFORMATION UBER SEQ ID NO: 77: 
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(A) LANGE: 81 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzei 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 77: 

ALSTRAMTGK RYGSRRRIGI LGCSTLTLNF WWPGACLEAQ TVKQALLACL LVTTSAPAVL60 
RLHPAPGTPP APEPPLSPCD G 81 

(2) INFORMATION OBER SEQ ID NO: 78: 

(A) LANGE: 104 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzei 

(D) TOPOLOGIE linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 78: 

TLLTIHIWTR DTVHEVHPSQ GDSGGSGAGG VPGAGWSLKT AGAEWTSKQ ASRACLTVWA 60 
SRHAPGHQKF RVSVLHPKIP ILLLLPYLFP VMALVDRAYQ SIES 104 

(2) INFORMATION UBER SEQ ID NO: . 79: 

(A) LANGE: 104 Aminosauren 

(B) TYP: Protein. 

(C) STRANG: einzei 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 79: 

PSSPRAVRHS GHQDSKMASV VPVKDKKLLE VKLGELPSWI LMRDFSPSGI FGAFQRGYYR 60 
YYNKYINVKK GSISGITMVL ACYVLFSYSF SYKHLKHERL RKYH 104 

(2) INFORMATION OBER SEQ ID NO: 80: 

(A) LANGE: 82 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 80: 

RRGASRGLPW CWHATCSIAT PFPTSISSTS GSANTTEEDT LCTPPPHDLG PSPSVRNTIS60 
IVAESFHILI GINLQIKHDW YV 82 

(2) INFORMATION OBER SEQ ID NO: 81: 

(A) LANGE: 115 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 81: 

KDSATIEIVF LTEGLGPRSW GGGVQSVSSS WFAEPLVLE MLVGKGVAKE HVACQHHGNP 60 
RDAPLLHIDV LVWPWTSn KRSENATRTE VPHQDPAWQL PQFDLQKFLV LHWYN 115 

(2) INFORMATION UBER SEQ ID NO: 82: 

(A) LANGE: 187 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
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b) eine allelische Variation der unter a) genannten Nukleinsaure-Sequenzen 
oder 

c) eine Nukleinsaure-Sequenz, die komplementar zu den unter a) oder b) genannten Nukleinsaure-Sequenzen 
ist. 

2. Eine Nukleinsaure-Sequenz gemaB einer der Sequenzen Seq. ID Nos. 1-50, oder eine komplementare oder alle- 
lische Van ante davon. 

3. Nukleinsaure-Sequenz Seq. ID No. 1 bis Seq. ID No. 50, dadurch gekennzeichnet dafi sie in Blasentumorge- 
webe erhoht exprimiert sind. 

4. BAC, PAC und Cosmid-Klone, enthaltend funktionelle Gene und ihre chromosomale Lokalisation, entsprechend 
den Sequenzen Seq. ID. No. 1 bis Seq. ID No. 50, zur Verwendung als Vehikel zum Gentransfer. 

5. Eine Nukleinsaure-Sequenz gemaB den Anspruchen 1 bis 4, dadurch gekennzeichnet, dafi sie eine 90%ige Ho- 
mologie zu einer humanen Nukleinsaure-Sequenz aufweist. 

6. Eine Nukleinsaure-Sequenz gemaB den Anspruchen 1 bis 4, dadurch gekennzeichnet, dafi sie eine 95%ige Ho- 
mologie zu einer humanen Nukleinsaure-Sequenz aufweist 

7. Eine Nukleinsaure-Sequenz, umfassend einen Teil der in den Anspruchen 1 bis 6 genannten Nukleinsaure-Se- 
quenzen, in solch einer ausreichenden Grofie, dafi sie mit den Sequenzen gemaB den Anspruchen 1 bis 6 hybridisie- 
ren. 

8. Ein Nukleinsaure-Sequenz gemaB den Anspruchen 1 bis 7, dadurch gekennzeichnet, dafi die Grofie des Frag- 
ments eine Lange von mindestens 50 bis 4500 bp aufweist 

9. Eine Nukleinsaure-Sequenz gemaB den Anspruchen 1 bis 7, dadurch gekennzeichnet, dafi die Grofie des Frag- 
ments eine Lange von mindestens 50 bis 4000 bp aufweist. 

10. Eine Nukleinsaure-Sequenz gemaB einem der Anspriiche 1 bis 9, die mindestens eine Teilsequenz eines biolo- 
gisch aktiven Polypeptids kodiert. 

11 . Eine Expression skassette, umfassend ein Nukleinsaure-Fragment oder eine Sequenz gemaB einem der Ansprii- 
che 1 bis 9, zusammen mit mindestens einer Kontroll- oder regulatorischen Sequenz. 

12. Eine Expressionskassette, umfassend ein Nukleinsaure-Fragment oder eine Sequenz gemaB Anspruch 11, wo- 
rin die Kontroll- oder regulatorische Sequenz ein geeigneter Promotor ist. 

1 3 . Eine Expressionskassette gemaB einem der Anspriiche 1 1 und 1 2, dadurch gekennzeichnet, dafi die auf der Kas- 
sette behndlichen DNA-Sequenzen ein 

Fusionsprotein kodieren, das ein bekanntes Protein und ein biologisch aktives Polypeptid-Fragment umfaBt 

14. Verwendung der Nukleinsaure-Sequenzen gemaB den Anspruchen 1 bis 10 zur Herstellung von Nfollangen-Ge- 
nen. 

15. Ein DNA-Fragment, umfassend ein Gen, das aus der \ferwendung gemaB Anspruch 14 erhaltlich ist 

16. Wirtszelle, enthaltend als heterologen Teil ihrer exprimierbaren genetischen Information ein Nukleinsaure- 
Fragment gemaB einem der Anspriiche 1 bis 10. 

17. Wirtszelle gemaB Anspruch 1 6, dadurch gekennzeichnet, dafi es ein prokaryontisches oder eukaryontische Zell- 
system ist 

18. Wirtszelle gemaB einem der Anspriiche 16 oder 17, dadurch gekennzeichnet dafi das prokaryontische Zellsy- 
stem E. coli und das eukaryontische Zellsystem ein tierisches, humanes oder Hefe-Zellsystem ist 

19. Ein Verfahren zur Herstellung eines Polypeptids oder eines Fragments, dadurch gekennzeichnet dafi die Wirts- 
zellen gemaB den Anspriichen 16 bis 18 kultiviert werden. 

20. Ein Antikorper, der gegen ein Polypeptid oder ein Fragment gerichtet ist welches von den Nukleinsauren der 
Sequenzen Seq. ID No. 1 bis Seq. ID No. 50 kodiert wind, das gemaB Anspruch 19 erhaltlich ist 

21 . Ein Antikorper gemaB Anspruch 20, dadurch gekennzeichnet daB er monoklonal ist. 

99 : Ttin Anrikorror ge maB An spruch 20 dadurch gekennzeichnet d a fi er ein Phage-Display- Antikorper ist 

24. Polypeptid-Teilsequenzen gemaB Anspruch 22, mit mindestens 80%iger Homologie zu diesen Sequenzen. 

25. Ein aus einem Phage-Display hervorgegangenen Polypeptid, welches an die Polypeptid-Teilsequenzen gemaB 
Anspruch 24 binden kann. 

26. Polypeptid-Teilsequenzen gemaB Anspruch 22, mit mindestens 90%iger Homologie zu diesen Sequenzen. 

27. Verwendung der Polypeptid-Teilsequenzen gemaB den Sequenzen Seq. ID No. 51-106, als Tools zum Auffin- 
den von WirkstofFen gegen den Blasentumor. 

28. Verwendung der Nukleinsaure-Sequenzen gemaB den Sequenzen Seq. ID No. 1 bis Seq. ID No. 50 zur Expres- 
sion von Polypeptiden, die als Tools zum Auffinden von WirkstofFen gegen den Blasentumor verwendet werden 
konnen. 

29. Verwendung der Nukleinsaure-Sequenzen Seq. ID No. 1 bis Seq. ID No. 50 in sense oder antisense Form, 

30. Verwendung der Polypeptid-Teilsequenzen Seq. ID No. 51-106 als Arzneunittel in der Gentherapie zur Be- 
handlung des Blasentumors. 

31. Verwendung der Polypeptid-Teilsequenzen Seq. ID No. 51-106, zur Herstellung eines Arzneimittels zur Be- 
handlung gegen den Blasentumor. 

32. Arzneimittei, enthaltend mindestens eine Polypeptid-Teilsequenz Seq. ID No. 51-106. 

33. Eine Nukleinsaure-Sequenz gemaB den Anspruchen 1 bis 10, dadurch gekennzeichnet dafi es eine genomische 
Sequenz ist 

34. Eine Nukleinsaure-Sequenz gemaB den Anspruchen 1 bis 10, dadurch gekennzeichnet dafi es eine mRNA-Se- 
quenzist. 

35. Genomische Gene, ihre Promotoren, Enhancer, Silencer, Exonstruktur, Intronstruktur und deren SpleiBvarian- 
ten, erhaltlich aus den cDNAs der Sequenzen Seq. ED No. 1 bis Seq. ID No. 50. 

36. Verwendung der genomischen Gene gemaB Anspruch 33, zusammen mit geeigneten regulativen Elementen. 
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37. Verwendung gemaB Anspruch 34, dadurch gekennzeichnet, daB das regulative Element ein geeigneter Promo- 
tor und/oder Enhancer ist 

38. Eine Nukleinsaure-Sequenz gemaB den Anspruchen 1 bis 7, dadurch gekennzeichnet, daB die GroBe des Frag- 
ments eine Lange von mindestens 300 bis 3500 bp aufweist 

Hierzu 10 Seite(n) Zeichnungen 
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ZEICHNUNGEN SEITE 1 



Nummen 
Int CI. 6 : 
Offenlegungstag: 



DE19818619A1 
C07K 16/00 

28. Oktober 1999 



Systematische Gen-Suche in der Incyte LifeSeq Daterv 



Normalgewebe \ /• Tumorgewebe 
-50.000 einzelne ESTs J r~56.006 einzelne ESTs , 



Prioritdtsliste 

hoch 

Prostata 
Brust 
Eierstock 
I f Blase 

Gebarmutter 

j nledilg 





Iterative 
Assemblierung 

mlt 
steigendem 
Mismatch 




-8.000 Contigs \ / -8.000 Contigs 

+ ) f ■ + 

-25.000 Singletons J V-25.000 Singletons 



normalgewebsspezi 
(erwartet: 100- 



500) 



Verglelch der 
Datenbanken 



unspezifisch 
expremierte 
Gene 





tumprgewebsspezifisch 
(drwartet: 100-500) 



Gene von Interesse 



Fig. 1 
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28. Oktober 1999 

Prinzip der EST-Assemblierung 



-50.000 ESTs pro Gewebe 




5000-6000 Contigs -25.000 ubrige Singletons 

V 

-30.000 Konsensus- 
sequenzen pro Gewebe 

Fig. 2a 
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ZEICHNUNGEN SEJTE3 



Nummen 
Int CI. 6 : 
Offenlegungstag: 



DE 198 18 619 A1 
C07K 16/00 

28. October 1999 



-50.000 ESTs 
eines Gewebes 
(z.B.: Blase Tumor) 



GAP4 assembly 1 . Runde: 

minimum intticd match: 20 
maximum pads per read: 8 
maximum percent mismatch: 0 




GAP4-Datenbank 1 



Icontigs 1 



Singletons 1 , , 



nicht 
assemblierte 
ESTs 



GAP4 assembly 2. Runde: 
minimum initial match: 20 
maximum pads per read: 8 
maximum percent mismatch: 1 




GAP4-Datenbank 2: 



[Contigs 2 



Stogletons^ 



nicht 
assemblierte 
ESTs 



GAP4 assembly 3. Runde: 

minimum initial match: 20 
maximum pads per read: 8 
maximum percent mismatch: 2 




GAP4-Datenbank 3: 



Conflgs 3 Singletons 3 



nicht 
assemblierte 
ESTs 

V V ^ 



Fig. 2b1 
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GAP4-Datenbank3: 



ContigsV) (^ngletons^ 




nlcht 
assemblierte 
ESTs 



Consensus 3 



GAP4 assembly 4. Runde: 
minimum initial match: 20 
maximum pads per read: 8 
maximum percent mismatch: 2 




GAP4-Datenbank 4: 




nicht 
assemblierte 
ESTs 



Consensus 4 j 



GAP4 assembly 5. Runde: 

minimum initial match: 20 
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(iii) HYPOTHETISCH: ja 

RPST AVAILABLE COPY 

(vi) HERKUNFT: PB©I ftvn,unw 

(A) ORGANISMUS: MENSCH 



Kxi) SEQUENZ-BESCHREIBUN 

ARAARGARRT SRAVTPTCAT PAGPMPCSRL PPSLRCSLHS ACCSGDPASY RLWGAPLQPT 60 
LGWPQASVP LLTDLAQWEP VLVPEAHPNA SLTMYVCTPV PHPDPPMALS RTPTRQISSS120 
DTDPPADGPS NPLCCCFHGP AFSTLNPVLR HLFPQEAFPA HPIYDLSQVW SWSPAPSRG180 
\Q 187 

(2) INFORMATION UBER SEQ ID NO: 83: . 

(A) LANGE:241 Aminosauren 

(B) TVP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 



(2) INFORMATION OBER SEQ ID NO: 84: 

(A) LANGE: 113 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 84: 



10 



IS 



20 



25 



(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja so 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 35 
(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 83: 

FFFFGLSNRC LLRAYAVLRL PFREPHECEA WPLPPGLQAP SLETPRNSRR LLSSSSTQST 60 «> 
SSQPLLGPPE CLSPAGCGGH HGPDLAQVID GVGREGFLGE EVPEHRVKGG ECWAMETAAE120 
RVGGAICRRI CVTRAOLPGG SPGEGHGRVR VGHRGADIHG QTCVRMCLRN QDRLPLGQVC18 0 
EEWHRGLGHH TQCGLQRGPP EPIAGRVPRA AGRVQGAAQG WRQPAAGHGP RWRCTSRCHS24 0 
T 241 



45 



SO 



55 



60 



65 
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MGWAGKASWG KRCLSTGLRV ENAGPWKQQQ RGLEGPSAGS" 3VSLELICBV GVLERAMGGS 60 
GWGTGVQTYM VRLALGCASG TRTGSHWARS VRSGTEAWGT" TPSVGOSGAP QSL 113 

(2) INFORMATION UBER SEQ ID NO: 85: 

(A) LANGE: 107 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEK0LTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 85: 

AFLSFLFSER FKRSTTLFPP SLLNLICTKS FALVGWETA LSLSTSVREC EPPWQVPVQG 60 
PAALHLGRVT GAPAVCPBCAS PWPFGLSLGR FRTEHQGRQA FQGISIN 107 

(2) INFORMATION UBER SEQ ID NO: 86: 

(A) LANGE: 107 Aminosauren 

(B) TYP: Protein 
(0) STRANG: einzel 
(D)TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 86: 

LRNRLWKVKL EEPDLISPTS KTPSEQQRPQ HPPRTGDSIF MATPCGGRLT TSHHIIPELS 60 
SSSGMTPSPP PPSSSFSSFC LFVSELSCLS FFLRDSKPPR LCFPRPF 107 

(2) INFORMATION UBER SEQ ID NO: 87: 

(A) LANGE: 115 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 
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(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 87: 

IQKGRGKQSR GGFESLRKKE RQESSETKRQ KDEKEEEGGG GDGVIPLEEL SSGMMWWLW 60 
NRPPQGVAMK MESPVRGGCC GRCCSDGVFD VGLMRSGSSS FTFQSRFLSQ WQWL 115 

(2) INFORMATION UBER SEQ ID NO: 88: 

(A) LANGE: 124 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEK0LTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 88: 

CSVRNLPRLR PKGQGEAFGH TAGAPVTRPR WRAAGPCTGT CQGGSHSRTL VLRLKAVSTT 60 
PTSANDLVQI KFRRDGGNRV VEALNLSEKR KDRKAQKQRD RRMKKKKREV VGTASSRWRS120 
SALG 124 

(2) INFORMATION UBER SEQ ID NO: 89: 

(A) LANGE: 198 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 89: 

EGAGGEWRCP AAGGPRGEDG PPGLRLTERA GLTTRHLTGT ADPSQKHNLR DPSSSKDERI 60 
QQTQPNREGA CSLNLHGLRY LCKPGVAPVL RVILRSCLFP NGFTSGSCRF SLGLSLILKW120 
VAGGRGWLRP LLALRLQSWE QDTSPEFHFF SCPNHAHTIV QNQSTFEKWL HGHPPGPRKL180 
HSKGLFTWQQ NPSPAVSP 198 

(2) INFORMATION UBER SEQ ID NO: 90: 
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(A) LANGE: 124 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 90: 

LPPVEPPVLK GSCRKOMHRN EERERSENEV WBARPGPTAQ GSSPPPDAPF HPPPQCLLSP 60 ( 
KAPASEVLGA HPPQCGQGGK GQVLDTAKCP EMTLLLTHFF GPWQSPTCPQ HGAPGRTGRQ1 2 0 
EGGW 124 

(2) INFORMATION UBER SEQ ID NO: 91: 



(A) LANGE: 147 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 91: 

NSGEVSCSQL CSLRASRGRS HPLPPATHFK MRLRPRLKRQ LPEVNPFGKR HERRMTLRTG 60 
ATPGLHKYRR PWRLRLQAPS LLGCVCCILS SLELLGSLRL CFWDGSAVPV RCLWRPALS120 
VSLSPGGPSS PLGPPAAGHL HSPPAPS 147 

(2) INFORMATION UBER SEQ ID NO: 92: 

(A) LANGE: 374 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ iD NO 92: 

SREAPESRRW AVWRSLESLP RHQLLCLPVG APPAPAMLSA LARPASAALR RSFSTSAQNN 60 
AKVAVLGASG GIGQPLSLLL KNSPLVSRLT LYDIAHTPGV AADLSHIETK AAVKGYLGPE120 
QLPDCLKGCD WVIPAGVPR KPGMTRDDLF NTNATIVATL TAACAQHCPE AMICVIANPV180 
NSTIPITAEV FKKHGVYNPN K1FGVTTLDI VRANTFVAEL KGLDPARVNV PVIGGHAGKT2 4 0 
IIPLISQCTP KVDFPQDQLT ALTGRIQEAG TEWKAKAGA GSATLSMAYA GARFVFSLVD300 
AMNGKEGWE CSFVKSQETE CTYFSTPLLL GKKGIEKNLG IGKVSSFEEK MISDAIPELK360 
ASIKKGEDFV KTLK 374 

(2) INFORMATION OBER SEQ ID NO: 93: 

(A) LANGE: 238 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 93: 

LNEGTFHNTF LSIHCIHKGE DKAGAGIRHG EGGRACSGFS LDHLRAGLLD PPSECCQLVL 60 
GKVHLGGALR DQGDDGLPSM ATNDRDVDSS WIQTLQLCNK GVGSDDVQGR HAEDFVGWH120 
SMLLENFCCD GDGGINRIGN DADHGFRAVL GTGSGQGGEN RGIGVEQVVP GHAWLSGDSS180 
RNNYHITTFQ AVRQLFRSEV AFHSGFGLDV AQICGHSGCV RDIIEGQAAH QGAVLQEK 238 

(2) INFORMATION UBER SEQ ID NO: 94: 

(A) LANGE: 242 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 94: 

EAGEEKAEEE GVAEEEGVNK FSYPPSHREC CPAVEEEDDE EAVKKEAHRT STSALSPGSK 60 
PSTWVSCPGE EENQATEDKR TERSKGARKT SVSPRSSGSD PRSWEYRSGE ASEEKEEKAH120 
KETGKGEAAP GPQSSAPAQR PQLKSWWCQP SDEEEGEVKA LGAAEKDGEA ECPPCIPPPS180 
AFLKAWVYWP GEDTEEEEDE EEDEDSDSGS DEEEGEAEAS SSTPATGVFL' KSWVYQPGED240 
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(2) INFORMATION UBER SEQ ID NO: 95: 

(A) LANGE: 237 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 95: 

RPGCIGQERT LDSEDKEDDS EAALGEAESD PHPSHFDQRA HFRGWGYRPG KETEEEEAAE 60 
DWGEAEPCPF RVAIYVPGEK PPPPWAPPRL PLRLQRRLKR PETPTHDPDP ETPLKARKVR120 
FSEKVTVHFL AVWAGPAQAA RQGPWEQLAR DRSRFARRIT QAQEELSPCL TPAARARAWA180 
RLRNPPLAPI PALTQTLPSS SVPSSPVQTT PLSQAVATPS RSSAAAAAAL DLSGRRG 237 

(2) INFORMATION UBER SEQ ID NO: 96: 

(A) LANGE: 890 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 96: 

QDEHLITFFV PVFEPLPPQY FIRWSDRWL SCETQLPVSF RHLILPEKYP PPTELLDLQP 60 
LPVSALRNSA FESLYQDKFP FFNPIQTQVF NTVYNSDDNV FVGAPTGSGK TICAEFAILR120 
MLLQSSEGRC VYITPMEALA EQVYMDWYEK FQDRLNKKW LLTGETSTDL KLLGKGNIII180 
STPEKWDILS RRWKQRKNVQ NINLEWDEV HLIGGENGPV LEVICSRMRY ISSQIERPIR240 
IVALSSSLSN AKDVAHWLGC SATSTFNFHP NVRPVPLELH IQGFNISHTQ TRLLSMAKPV300 
YHAITKHSPK KPVIVFVPSR KQTRLTAIDI LTTCAADIQR QRFLHCTEKD LIPYLEKLSD360 
STLKETLLNG VGYLHEGLSP MERRLVEQLF SSGA1QWVA SRSLCWGMNV AAHLVIIMDT420 
QYYNGKIHAY VDYPIYDVLQ MVGHANRPLQ DDEGRCVTMC QGSKKDFFKK FLYEPLPVES4 8 0 
HLDHCMHDHF NAEIVTKTIE NKQDAVDYLT WTFLYRRMTQ NPNYYNLQGI SHRHLSDHLS540 
ELVEQTLSDL EQSKCfSIED EMDVAPLNLG MlAAYYYINY TTIELFSMSL NAKTKVRGLI 600 
EIISNAAEYE NIPIRHHEDN LLRQLAQKVP HKLNNPKFND PHVKTNLLLQ AHLSRMQLSA660 
ELQSDTEEIL SKAIRLIQAC VDVLSSNGWL SPALAAMELA QMVTQAMWSK DSYLKQLPHF720 
TSEHIKRCTD KGVESVFDIM EMEDEERNAL LQLTDSQIAD VARFCNRYPN IELSYEWDK7 8 0 
DSIRSGGPW VLVQLEREEE VTGPVIAPLF PQKREEGWWV VIGDAKSNSL ISIKRLTLQQ840 
KAKVKLDFVA PATGAHNYTL YFMSDAYMGC DQEYKFSVDV KEAETDSDSD 890 

(2) INFORMATION UBER SEQ ID NO: 97: 
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(A) LANGE: 281 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIB linear 

(ii) MOLEKULTYP: ORF 
i) HYPOTHETISCH: ja ' 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 97: 

GDGSAEHGPR PLAAPLVTSR GAPASARPRG ALPGGSAPSA PHGQLPGRAQ PAPVSGPPPT 60 
SGLCHFDPAA PWPLWPGPWQ LPPHPQDWPA QPDIPQDWVS FLRSFGQLTL C PRNGTVTGK1 2 0 
WRGSHWGLL TTLNFGDGPD RNKTRTFQAT VLGSQMGLKG SSAGQLVLIT ARVTTERTAG1 8 0 
TCLYFSAVPG ILPSSQPPIS CSEEGAGKAT LSPRMGEECV SVWSHEGLVL TKLLTSEELA2 4 0 
LCGSRLLVLG SFLLLFCGLL CCVTAMCFHP RRESHWSRTR L 281 

(2) INFORMATION UBER SEQ ID NO: 98: 

(A) LANGE: 206 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEK0LTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 98: 

RLEPRSVTRS RRAVSRLSAR PGKVSAVMAF LASGPYLTHQ QKVLRLYKRA LRHLESHCVQ 60 
RDKYRYFACL MRARFEEHKN EKDMAKATQL LKEAEEEFWY RQHPQPYIFP DSPGGTSYER120 
YDCYKVPEWC LDDWHPSEKA MYPDYFAKRE QWKKLRRESW EREVKQLQEE TPPGGPLTEA1 8 0 
LPPARKEGDL PPLWWYIVTR PRERPM 206 

(2) INFORMATION UBER SEQ ID NO: 99: 

(A) LANGE: 139 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 
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(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 99: 

PLVPSFPSAV SSTVLSWQSN QDTLPSQKDA SHLSTILGPC SNRISHRRCP QESQGRCMAV 60 
DADGTRILPR PPSAAGWPSP YPFHSYVLQT GLSSNKQSIG ICLSGRTTTR GGVAPAYKAA120 
TPFADGSGRV PTPRTPLRR 139 

(2) INFORMATION UBER SEQ ID NO: 100: 

(A) LANGE: 79 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 100: 

APFWDLVAIV SLIGGAPRRV REDVWLWMLT VPEFFLGLLQ QLGGLRHILF ILMFFKPGSH 60 
QTSKVSVFVS LDAPRLEVA 79 

(2) INFORMATION UBER SEQ ID NO: 101: 

(A) LANGE: 89 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 101: 

VGGACAVALP QAAAMAGQED PVQREIHQDW ANREYIEIIT SSIKKIADFL NSFDMSCRSR60 
LATLNEKLTA LERRIEYTEA RVTKGETLT "89 

(2) INFORMATION 0BER SEQ ID NO: 102: 
(A) LANGE: 88 Aminosauren 
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(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 102: 

NSAVLLLGSC FTQHRPHGKG PSSLQLLPFS LKSNRAYSCF SFFKSVAFGL CHLGCGWCG60 
KKFRGTVGND VRHFTFSVTF YTSTCQCI -88 

(2) INFORMATION UBER SEQ ID NO: 103: 

(A) LANGE: 89 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 
HYPOTHETISCH: ja 



(vl) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 1 03: 

HFIHLLVNVF ETFTAKSLGL FVKVLLTSIF LSLSLKLSLK FSLPLHCFCE QSLS PPHLWW60 
EVRGSPGQDT HALRMWLPEN WANPPVSC 89 

(2) INFORMATION OBER SEQ ID NO: 104: 

(A) LANGE: 240 Aminosauren . 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
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(xl) SEQUENZ-BESCHREIBUNG: SEQ ID NO 104: 

REQILFIEIR DTAKGGETEQ PPSLSPLHGG RMPEMGEGIQ SLARETQSHR GRRQGWDATW 60 
VTRCRESLNR GGAGAGKRAG ALAHHVFLAL IEPNLAEREA SEEEVKACSD ETWADLLVKl 2 0 
WYVLGAILK IFLREGNVLN QHSGMDIEKY SEHYQHDHSP GAEDDAAGGQ LRPTAQERRH18 0 
KEGSRGSPRC KRARKAVGES PGCPRRGAEG AWPRPQPSHP GAAGGKGWAP LRTFAGDGAV24 0 

(2) INFORMATION UBER SEQ ID NO: 105: 

(A) LANGE:136 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 105: 

RLYMFWGPFS KSFSVKGTSS INTAEWTLKN IPSIINMITP QALRTMPQAA SFGPQHRSDA 60 
IKKGVGDRRG ASGLGKRWEK AQDALAGGQR GRGPGLNHPI RGRQAEKAGL LSGLSRETAP12 0 
SETKTAPGET FLDLCS 136 

(2) INFORMATION UBER SEQ ID NO: 106: 

(A) LANGE: 173 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 1 06: 

LQRSRKVSPG AVLVSDGAVS. RESPERSPAF SACRPRMGWL RPGPRPLCPP ARASWAFSHR 60 
FPSPLAPRRS PTPFFMASLL CCGPKIAACG IVLSAWGVIM LIMLGIFFNV HSAVLIEDVP120 
FTEKDFENGP QNIYNLYEQV SYNCFIAAGL YLLLGGFSFC QVRLNKRKEY MVR 173 



Patentanspriiche 

1 . Eine Nukleinsaure-Sequenz, die ein Genprodukt oder ein Teil davon kodieit, umfassend 

a) eine Nukleinsaure-Sequenz, ausgewahlt aus der Gruppe Seq. ID No. 2-5, 7-13, 16, 18, 20, 23, 26-27, 
31-32,36,45. 
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High-throughput technologies, such as proteomic screening and ONA micro-arrays, produce vast 
amounts of data requiring comprehensive analytical methods to decipher the biologically relevant 
results. One approach would be to manually search the biomedical literature; however, this would be 
an arduous task. We developed an automated literature-mining tool, termed MedGene, which 
comprehensively summaries and estimates the relative strengths of all human gene-disease 
relationships in Medline. Using MedGene, we analyzed a novel micro-array expression dataset 
comparing breast cancer and normal breast tissue in the context of existing knowledge. We found no 
correlation between the strength of the literature association and the magnitude of the difference in 
expression level when considering changes as high as 5-fold; however, a significant correlation was 
observed (r = 0.41; p = 0.05) among genes showing an expression difference of 10-fold or more. 
Interestingly, this only held true for estrogen receptor (ER) positive tumors, not ER negative. MedGene 
identified a set of relatively understudied, yet highly expressed genes in ER negative tumors worthy of 
further examination. 

Keywords; bioinformatics • micro-array • text mining • gene-disease association » breast cancer 



Introduction 

At its current pace, the accumulation of biomedical, literature 
outpaces the ability of most researchers and clinicians to stay 
abreast of their own immediate fields, let alone cover a broader 
range of topics. For example, to follow a single disease, e.g., 
breasr cancer, a researcher would have had to scan 130 different 
journals and read 27 papers per day in 1999. 1 This problem is 
accentuated with high-throughput technologies such as DNA 
micro-arrays and proteornics. which require the analysis of 
large dataseis involving thousands of genes, many of which are 
unfamiliar to a particular researcher. In any microarray experi- 
ment, thousands of genes may demonstrate statistically sig- 
nificant expression changes, but only a fraction of these may 
be relevant to the study. The ability to interpret these datasets 
would be enhanced if. they could be compared to a compre- 
hensive summary of what is known about all genes. Thus, there 
is a need to summarize existing knowledge in a format that 
allows for the rapid analysis of associations between genes and 
diseases or other specific biological concepts. 

'One solution to this problem is to compile structured digital 
resources, such as the Breast Cancer Gene Database* and the 
Tumor Gene Database. 2 However, as these resources are hand- 
curated. the labor-intensive review process becomes a rate- 
limiting step in the growth of the database. As a result, these 
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databases have a limited scale and the genes are not selected 
in a systematic fashion. 

An alternative approach is automated text mining; a method 
which involves automated information extraction by searching 
documents for text strings and analyzing their frequency and 
context. This approach has been used successfully in several 
instances for biological applications. In most cases, it has been 
applied to extract information about the relationships or 
interactions thai proteins or genes have with one another, in 
the literature or by functional annotation. 3 ' 7 Thus far, few 
publication have applied text-mining to examine the global 
relationships between genes and diseases. Perez-lratxeta et al. 
automatically examined the GO (Gene Ontology} annotation 
of genes and their predicted chromosomai locations in order 
to identify genes linked to inherited disorders. 8 

To obtain a more global understanding of disease develop- 
ment, it would be valuable to incorporate information regarding 
all possible gene-disease relationships, including biochemical, 
physiological, pharmacological, epidemiological, as well as 
genetic. This information would enable comprehensive com- 
parisons between large experimental datasets and existing 
knowledge in the literature. This would accomplish two things. 
First. It would serve to validate experiments by demonstrating 
thai known responses occur as predicted. Second, it would 
rapidly highlight which genes are corroborated by the literature 
and which genes are novel in a given context. We have utilized 
a computational approach to literature mining to produce a 
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comprehensive set of gene-disease relationships. In addition, 
we have developed a novel approach to assess the strength of 
each association based on the frequency of citation and co- 
citation. We applied this tool to help interpret the data from a 
large micro-array gene expression experiment comparing 
normal and cancerous breast tissue. 

Methods 

MedGene Database. MedCene is a relational database, stor- 
ing disease and gene information from NCBI, text mining re- 
sults, statistical scores, and hyperlinks to the primary lit- 
erature. MedGene has a web- based user interface for users to 
query the database (http://hipseq. med.harvard.edu/MedGene/). 

Text Mining Algorithms. MeSH files were downloaded from 
the MeSH web site at NLM (Nation Library of Medicine) fhttp:// 
www.nlm.nih.gov/mesh/meshhome.html) and human disease 
categories were selected. LocusLink files were downloaded from 
the LocusLink web site at NCBI (http://www.ncbi.nih.gov/ 
LocusUnk/). Official/preferred gene symbol, official/ preferred 
gene name, and gene alternative symbols and names, all 
relevant annotations and UHls for each LocusUnk record, were 
collected. Gene search terms were used for literature searching 
and included aJ! qualified gene names, gene symbols, and gene 
family terms. Primary gene keys, predominantly qualified gene 
family terms and gene official/preferred symbols, were used 
to index Medline records. If the official /preferred gene symbols 
did not meet the standards to be an index, then qualified gene 
official /preferred names were used. A local copy of Medline 
records (up to July, 2002) was pre-selected. 

A JAVA module examined the MeSH terms and then indexed 
each Medline record with the appropriate disease terms. A 
separate JAVA module was used to examine the titles and 
abstracts for gene search terms and then to index the gene- 
related Medline records with the relevant primary gene key(s). 

Statistical Methods. For every gene and disease pair, we 
counted records that were indexed for both gene arid disease 
(double positive hits), for disease only (disease single hits), for 
gene only (gene single hits), and for neither gene nor disease 
(double negative hits) to generate a 2 x 2 contingency table. 
On the basis of the contingency table-framework, we applied 
different statistical methods to estimate the strength of gene- 
disease relationships and evaluated the results. These methods 
included chi-square analysis, Fisher's exact probabilities, rela- 
tive risk of gene, and relative risk of disease 1 * (hup;// 
hipseq.med.harvard.edu/MedGene/). In addition, we computed 
the "product of frequency", which is the product of the 
proportion of disease/gene double hits to disease single hits 
and the proportion of disease/gene double hits to gene single 
hits. To obtain a normal distribution, we transformed all the 
statistical scores using the natural logarithm. We selected the 
log of the product of frequency (LPF) to validate MedCene and 
to use for the analysis with the micro- array data. Spearman 
rank-correlation coefficients were used to assess the linear 
relationship, between LPF and micro- array fold change in 
expression level. . 

Global Analysis, Diseases with at least 50 related genes were 
selected for clustering analysis, and the LPF scores were 
normalized with total score for each disease. Hierarchical 
clustering was done with the "Cluster" software and the 
clustering result was visualized using "TreeViewer" (htrp:// 
rana.lbl.gov/EisenSoftvvare.htin), 
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Breast Tissue Micro-Arrays. Eighty-nine breast cancer 
samples (79% ER-positive) and 7 normal breast tissue samples 
were selected from the Harvard Breast SPORE frozen tissue 
repository and were representative of the spectrum of histo- 
logical types, grades, and hormone receptor immuno-pheno- 
types of breast cancer. Biotinyiated cRNA, generated from the 
total RNA extracted from the bulk tumor, was hybridized to 
Affymetrix U95A oligonucleotide micro-arrays. These micro- 
arrays consist of 12 400 probes, which represent approximately 
9000 genes. Raw expression values were obtained using GENE- 
CHIP software from Alfymetrix, and then further analyzed using 
the DNA-Chip Analyzer (dChip) custom software. 

Results 

Automated Indexing of Medline Records by Disease and 
Gene. To study the gene-disease associations in the literature, 
we first compiled complete lists for human diseases and human 
genes. To index all Medline records that were relevant to 
human diseases, the Medical Subject Heading (MeSH) index 
of Medline records was utilized. MeSH is a controlled medical 
vocabulary from the National Library of Medicine and consists 
of a set of terms or subject headings that are arranged in both 
an alphabetic and an hierarchical structure. Medline records 
are reviewed manually and MeSH terms are added to each with 
software assistance. 910 Twenty-three human disease category 
headings along with all of their child terms (see the Supporting 
Information, Supplemental Table 1. or visit http://hipseq. 
med.harvard.edu/MedCene/publication/s_Table 1 .html) were 
selected from the 2002 MeSH index creating a list of 4033 
human diseases. 

No index comparable to the MeSH index exists- for genes, 
and thus, it was necessary to apply a string search algorithm 
for gene names or symbols found in Medline text. A complete 
list of genes, gene names, gene symbols, and frequently used 
synonyms were collected from the LocusLink database at 
NCBI. 11 ' 12 which contains 53 259 independent records keyed 
by an official gene symbol or name (June 18 th , 2002). For the 
purposes of this study, no distinction was made between genes 
and their gene products. Authors often use the same name for 
both, differentiating the two only by the use of italics, if at all. 
For the intended use of this study, mis lack of distinction is 
unlikely to have a large effect and may in fact be beneficial. 

Initial attempts to search the literature using these lists 
revealed several sources of false positives and false negatives 
(Table I). False positives primarily arose when the searched 
term had other meanings, whereas false negatives arose from 
syntax discrepancies necessitating the development of filters 
to reduce these errors. The syntax issues were readily handled 
by including alternate syntax forms in the search terms. The 
false positive cases, caused by duplicative and unrelated 
meanings for the terms, were more difficult to manage. Where 
possible, case sensitive string mapping reduced inappropriate 
citations, in many cases, however, this was not sufficient and 
the terms had to be eliminated entirely, thereby reducing the 
false positive rate but unavoidably under-representing some 
genes. 

For the purposes of data tracking, a primary gene key was 
selected to represent all synonyms that correspond to each 
gene, Medline records were indexed with a primary gene key 
when any synonym for that key was found in the title or 
abstract. Case- insensitive string mapping was used for all 
searches except as noted above. No additional weight was 
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Table 1. Systematic Sources of False Positives and False Negatives in Unfiltored Data" 



source of error 



error rype 



example 



gene symbol/name 
is not unique 



gene symbol is 

unrelated abbreviation 
gene symbol/name 

has language meaning 
nonstandard syntax 
unofficial gene name/symbol 
nonspeclfied gene name 



false positive MAC— myelin 

associated glycoprotein 
M/iC—malignancy-associated 
protein 

false positive f/t — pallid homologue (mouse), 

pallidin (also abbrev. for Pennsylvania) 

false positive WA 5- Wiskott-Aldrich Syndrome 

(also the word "was") 

false negative RAG- 1 instead of BACl 

fa I se n ega t i ve P53 i n st ead o f TP 5 3 

false negative estrogen receptor instead of 
Estrogen receptor 1 



fitter solution 



eliminate this term 

eliminate this term 

case-sensitive suing search 

add dash term 

add all gene nicknames 

add family stem term 



" In preliminary studies. Medline was searched for co-occurrence of genes and diseases and (he resulting output was evaluated to identify error sources that 
were amenable tu global niters. Each error source is categorized by the type of error it causes: false positives are suggested relationships that are not real and 
false negatives are real relationships that are unde represented. The filter solutions used are indicated. Note that in some cases, the filter solution itself Introduces 
error. In general, error rates maximized sensitivity, even at the expense of specificity if needed. 



added for multiple occurrences of a term or the co-occurrence 
of multiple synonyms for the same gene key. 

Medline records were searched with all qualified gene 
identifiers, such as the official/preferred gene symbol, the 
official/preferred gene name, all gene nicknames and all syntax 
variants. In situations where there are several members of a 
gene family or splice variants, some authors prefer to use a 
shortened gene family name, e.g.. estrogen receptor instead of 
estrogen receptor 1 (ESRi), creating a source of false negatives. 
For this reason, gene family stein terms were created for all 
genes that have an alpha or numerical suffix {e.g., IL2RA, IVFfi. 
ESR1> etc.) and then used to search the literature. The family 
stem terms were handled separately from the specific gene 
names so that it would be clear when linkages were made to 
the gene family versus a specific member in that family. 

To improve performance and accuracy, some pie-selection 
was applied to the records that were scanned. First, review 
articles were eliminated to avoici redundant treatment of 
citations. Second, non- English journals were removed because 
the natural language filters were only relevant to English 
publications. Finally, journals unlikely to contain primary data 
about gene-disease relationships were also removed (e.g.. Int. 
j. Health Educ, Bedside Nurse, and / Health Econ.). Together, 
these filters reduced the 12 198 221 Medline publications (July 
2002) by 37%. 

Ranking the Relative Strengths of Gene-Disease Associa- 
tions. In total, there were 618 708 gene-disease co-citations, 
in which 16% (8297) of alt studied genes had been associated 
to a disease and 96% (3875) of all diseases had been associated 
to at least one gene. To rank the relative strengths of gene 
disease relationships, we tested several different statistical 
methods and examined the results. With the exception of the 
relative risk estimates, the methods provided similar results 
with respect to the rank order of the gene-disease association 
strengths. However, after comparing the results to other 
databases and after consulting disease experts, the log of the 
product of frequency (I.PF) was selected for further analysis 
because it gave the best results overall. 

Validation of MedGene. In developing this tool, it was 
important to minimize the number of missed genes (false 
negatives) and miscalled genes (false positives). However, in 
situations when these goals were in conflict, inclusiveness was 
prioritized. To determine the false negative rate in MedGene, 
breast cancer was used as a test case because it was associated 
with more genes than any other human disease and because 




Figure 1. Estimation of the false negative rate by comparison 
with hand-curated databases. The breast cancer-related genes 
identified by MedGene were compared with those listed in 
several other databases including the Tumor Gene 'Database 
(TGD),* the Breast Cancer Gene Database(BCG). 1 GeneCards 
(GC) 17 and Swissprot. 18 Genes were considered false negatives 
if they were represented in at least one of these other databases 
and not in MedGene and their link to breast cancer was sup- 
ported by at least one literature reference. All literature references 
were verified by manual review to confirm their validity. The 
number of genes in each database or shared by more than one 
database is indicated. The false negative rate was calculated by 
genes missed at MedGene (26)/lota! number of nonoverlapping 
genes in other databases (285). 

there were several public databases that link genes to breast 
cancer. We compared the list of breast cancer-related genes 
from MedGene to these databases, illustrated in Figure 1. 
Among the 285 distinct breast cancer-related genes that were 
supported by at least one literature citation in these hand- 
cu rated databases, 26 were absent from MedGene, suggesting 
a false negative rate of approximately 9%. To determine why 
these were missed, all literature references for these genes (80 
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papers) were reviewed manually (see the Supporting Informa- 
tion. Supplemental Table 2. or visit http://hipseq.med. 
harvard.edu/MedCene/publication/s_.Tablc 2.html). Among 
these papers, most false negatives were caused by nonstandard 
gene terms or gene terms eliminated by our specificity fitters. 
Few genes were missed because they were only mentioned in 
review papers (0.4%) or they appeared only in the body of the 
manuscript but not the abstract or title (1.1%). Of note, 
MedCene identified approximately 2000 additional breast 
cancer-related genes not listed in any other database. 

To assess the false positive error rate, two complementary 
approaches were used: a detailed analysis of one disease and 
a globaJ examination of 1000 diseases. The detailed approach 
examined the false positive error rate and its sources, whereas 
the global approach tested whether the overall results made 
biomedical sense. 

Using the LPF. 1467 genes related to prostate cancer were 
assembled in rank order. We then retrieved approximately 300 
Medline records each for the highest ranked 100 and the lowest 
ranked 200 genes and manually reviewed the titles and 
abstracts to determine the verily of the association. Nearly 80% 
of the highest ranked 100 genes fell into one of the five 
categories that reflect meaningful gene-disease relationships 
(see the Supporting Information. Supplemental Table 3, or visit 
http://htpseq.rned.harvard.edu/MedGene/publieation/ 
s_Table 3.html). Among the lowest ranked 200 genes, ap- 
proximately 70% reflected true relationships. Of the 600 records 
reviewed, there were only two in which the association between 
the gene and the disease was described as negative. Both were 
genes with very low scores. In both cases, the authors did not 
argue the absence of any relationship, but rather that a 
particular feature of the gene or protein was not shown to be 
related to human prostate cancer, 1X14 

The coincidence of some gene symbols with medical ab- 
breviations, chemical abbreviations and biological abbrevia- 
tions resulted in most of the false positives (see the Supporting 
Information, Supplemental Table 4, or visit http://hipse- 
cj.rned.harvard.edu/MedGene/publication/s_TabIe 4.html), em- 
phasizing the importance of the filters that were added in the 
search algorithm (Table 1). Without the filters, the false positive 
rate more than doubled, and the false negative rate rose 
dramatically (data not shown). For example, among the papers 
about breast cancer, there were only 12 Medline records that 
referred to ESRl and 10 to ESR2, whereas almost 2000 papers 
mentioned estrogen receptor without specifying ESRl or ESR'£ 
this latter group was detected by the family stem term filter. 

To further validate these results, a global analysis of the gene- 
disease relationships described by Med Gene was performed. 
For this experiment, it was reasoned that the more closely 
related the diseases are to one another, the more they will be 
related to the same gene sets. Thus, if the relationships defined 
by MedCene accurately reflected the literature, then an unsu- 
pervised hierarchical clustering of the gene data should group 
diseases in a manner consistent with common medical think- 
ing. Conversely, if the clustered diseases do not make sense 
biologically or medically, it may reflect excessive false positives, 
false negatives, or inappropriate scoring of the data. 

To execute this experiment, the gene sets and the corre- 
sponding LPF values for 1000 randomly selected diseases (each 
with at least 50 gene relationships) were used as a data set for 
clustering the diseases. A review of the results showed that the 
resulting disease clusters were indeed logical based upon 
common medical knowledge (see the Supporting Information, 
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Supplemental Figure 1 , or visit http://hipseq.med.harvard.edu/ 
MedCene/pub!ication/s_Figure l.ht.ml). For example, in one 
such cluster shown in Figure 2, diabetes and its complications 
grouped together and were also closely linked to diseases 
associated with starvation states. 

The number of genes associated with a given disease can 
be estimated by adjusting the MedCene number up by I he false 
negative rate (-9%) and down by the false positive rate (-26% 
on average). Using this, the average disease has 103.7 ± 45.3 
(mean ± s.d.) genes associated with it. although the range is 
quite broad with 2359 genes related to breast cancer, 2122 
genes related to lung cancer and no genes related to a number 
of diseases. 

Applying MedGene to the Analysis of Large Datasets. Access 
to a comprehensive summary of the genes linked to human 
diseases provided an opportunity to analyze data obtained from 
a high-throughput experiment. We compared the MedCene 
breast cancer gene list to a gene expression data set generated 
from a micro-array analysis comparing breast cancer and 
normal breast tissue samples. Micro- array analysis identified 
228G genes that had greater than a 1-fold difference in mean 
expression level between breast, cancer samples and normal 
breast samples. Using MedGene. we sorted the 2286 genes into 
four classes: 555 genes directly linked to breast cancer in the 
literature by gene term search (first-degree association by gene 
name); 328 genes directly finked by family term search (first- 
degree association by family term); 1021 genes linked to breast 
cancer only through other breast cancer genes (second-degree 
association); and 505 genes not previously associated with 
breast cancer. (See the Supporting Information, Supplemental 
Figure 2, or visit http://hipseq.med.harvard.edu/MedCene/ 
publication/s_Figure 2.html.) Among die 505 previously un- 
related genes, 467 were either newly identified genes or genes 
that had not previously been associated with any disease. 
Among the remaining 38 genes, 9 had been related to other 
cancers, specifically esophageal, colon, uterine, skin, and cervix. 

To determine whether the genes highlighted by the micro- 
array analysis were more likely to have been previously linked 
to breast cancer in die literature, we created a two-dimensional 
plot of the fold change' of expression level between breast 
cancer and normal tissue versus the literature score (LPF) 
(Figure 3A). There was a broad spread of expression changes 
among the genes directly linked to breast cancer ranging from 
less than 1-fold change "(68%) to over 40-fold (0,3%). Notably, 
the majority of genes with greater than 10- fold expression 
changes were linked to breast cancer by first -degree associa- 
tion. 

Among all 754 genes directly linked to breast cancer in the 
literature, there was no correlation between LPF and micro- 
array fold change (r — 0.018, /rvalue = 0.62). However, when 
we stratified the analysis based on the magnitude of the fold 
change, we observed an increasing trend in correlation (Figure 
3B) suggesting that, genes with a more substantial change in 
expression level were more likely to have a stronger association 
in the literature. For genes that had I0 r fold change or more in 
expression level, the correlation increased to 0.41 [p- value - 
0.05). 

When we evaluated the micro- array data separately for ER 
positive and FR negative tumors, the trend in correlation 
between fold change and literature score was highly dependent 
on estrogen receptor status. Interestingly, there was a similar 
trend in correlation for EH positive tumors, but no trend in 
correlation for i:K negative tumors. 
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Figure 2. Global validation by clustering analysis. 2(A). The gene sets and- the corresponding LPF values for 1000 diseases, each with 
at least 50 gene relationships, were used in an unsupervised clustering of the diseases based on the gene patterns associated with 
them. A sample of the data is shown here, 2{B). One of the resulting clusters is shown that corresponds to blood sugar states. Diabetes 
terms (above the line) and starvation states terms (under the line) clustered together.. Within these groups, there is also clustering of 
diabetic small vessel complications, altered serum chemistries, nutritional disorders, etc. (Supplemental Figure 1: http://hipseq.med, 
harvard.edu/MedGenc/publication/s_Figure 1 .html). 



Finally, to validate our findings, we computed similar cor- 
relations between the breast cancer expression data and 
LPF scores generated by Med Gene for hypertension, a . 



disease unrelated to breast cancer. As expected, we did not 
observe an increasing trend in correlation for hyperten- 
sion. 
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Figure 3. Relationship between literature score and functional data for breast cancer. 3A. The data from an expression analysis of 
samples for breast tumors and normal breast tissue were analyzed to indicate the fold difference of expression level between breast 
tumor and norma) sample {cutoff > 3-fold change). The fold changes were plotted against the literature score for the same gene set. 
Green dots represent first-degree association by gene search, blue dots represent first-degree association by family search and red 
dots represent no-association. Some well-studied genes, such as BRCA2 (pink circle), are not reflected by a substantial difference in 
expression level. Furthermore, the majority of genes that have no association with breast cancer in the literature had less than 10-fold 
expression changes (shaded area). 3B. The Spearman rank-correlation coefficients between literature score (LPF) ana the fold change 
of expression level between tumor and normal breast samples (y-axis) in relation to the amount of fold change of expression level 
(x-axis). Gene rank lists were generated for breast cancer (blue) and hypertension (pink). Correlations were also computed between 
the breast cancer gene LPF scores and fold change expression data among estrogen receptor positive tumors only (light blue) and 
estrogen receptor negative tumors only (purple). 
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breast neoplasms 



hypertension 



rheumatoid arthritis 



estrogen receptor 

PGR 

ERBB2 

BRCAi 

BRCA2 

EGFR 

CYP19 

TFFl 

PSEN2 

TP53 

CBS3 
CEACAM5 

ERBB3 

cyclin 

COX5A 

cathepsin 

ERBB4 

TRAM 

CCNOi 

EOF 

MVCi 

insulin-like 
BCL2 

mucin 
FGF3 



REN 
DBP 
IMP 
ACT 
INS 

kalllkrein 
ACE 

endotlielin 

S100AO 

BDK 

DIANFH 
SARI 

PJH 

CD59 

ALB 

CYP1IB2 
MAT2B 
angiotensin 
receptor 

ACTR2 

NPPA 

LVM 

PBH 
NPY 

POMC 

neuropeptide 



RA 

TNFRSF1 OA 
CRP 
AS 
ESR1 

HLA-DRB1. 
DRl 

imerleukin 

TNF 

1L6 

collagen 
ILIA 

ACR 

TNFRSF12 
IL2 

CHI3LI 
1LH 

imerleukin ! 
matrix 

metatloproteinn.se 

interferon 

CD68 

HA 
IU7 

MMP3 
SIL 



t 
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bipolar disorder 



ERDAl 

SNAP29 

PFKL 

DRD2 

TRH 

IMPA2 

HTR3A 

DR03 

REM 

KCNN3 

ORO-i 
UTR2C 

RELN 
DBH 
MAO A 
COMT 
HTR2A 

SYNJl 

INPP1 

NEDD4L 

FRA13C 

transducer of 

ERBB2 

BAIAP3 

ATPIB3 
0RD5 



atherosclerosis 



apolipoprotein 

APOE 

LDLR 

ELN 

ARC I 

APOB 

APOAl 

MSRl 

LPL 

PONl 

plasminogen 
activator inhibitor 
PLC 

vascular cell 

adhesion molecule 

ATOHJ 

VWF 

INS 

ARC2 

ABCAl 

OLRl 

collagen 

MCP 

lipoprotein 
AP0A2 
intercellular 
adhesion molecule 
RAB27A 



* MedCene results for die top 2i> genes associated with breast neoplasms, hypertension, rheumatoid arthritis, bipolar disorder, and atherosclerosis, respectively, 
ranked by LPK scores. The hyperlink to all the papers co-citing the gene and the disease is available at MedCene website (http://hipseq.med.harvard.edu/ 
MedGene/}. 



Discussion 

The Human Genome Project heralded a new era in biological 
research where the emphasis on understanding specific path- 
ways has expanded to global studies of genomic organization 
and biological systems. High-throughput technologies can 
provide novel insight into comprehensive biological function 
but also introduces new challenges. The utility of these 
technologies is limited to the ability to generate, analyze, and 
interpret large gene lists. MedCene, a relational database 
derived by mining the information in Medline, was created to 
address this need. MedCene users can query for a rank-ordered 
list of human gene-disease relationships {Table 1) for one or 
more diseases. Each entry is hyperlinked to the original papers 
supporting each association and to other relevant databases. 

MedCene is an innovative extension of previous text mining 
approaches. Perez-Iratxeta et al. used the CO annotation and 
their chromosomal locations to predict genes that may con- 
tribute to inherited disorders. 8 MedCene takes a broader view 
and includes all diseases and all possible gene-disease relation- 
ships. Furthermore. MedGene utilizes co-citation to indicate a 
relationship rather than GO annotation, which is limited to the 
subset of genes that have CO annotation. Our approach is 
complementary to that taken by Chaussabel and Sher, who 
used the frequency of co-cited terms to cluster genes into a 
hierarchy of gene-gene relationships. 6 

A unique aspect of this tool is the ability to assess the relative 
strengths of gene-disease relationships based on the frequency 
of both co-citation and single citation. This presupposes that 
most co-citations describe a positive association, often referred 
to as publication bias 15 and is supported by our observations 



that negative associations are rare (Supplemental Table 3: 
h tt p://hipseq. med.harvard.edu /MedCene/publica tion/s„Ta- 
ble 3.html). Of course, relationships established by frequency 
of co- citation do not necessarily represent a true biological link; 
however, it is strong evidence to support a true relationship. 

Another important feature of MedGene is the implements -' 
tion of software filters that substantially reduced the error rate. 
We estimate that less than 10% of all associations were missed 
and at least 70% of even the weakest associations were real. 
For this study, all of the filters that we applied were general 
ones, e.g., expanding the list of all gene names to address the 
different syntax forms used by different journals, eliminating 
gene names that correspond to common English words, etc, 
The majority of the remaining search term ambiguities were 
idiosyncratic and difficult to identify systematically without 
causing a significant rise in false negatives. Alternative ap- 
proaches, such as the examination of the nearest neighbor 
terms, need to be considered to further reduce the false positive 
rate. 

It is not uncommon to see expression changes in micro- 
array experiments as small as 2 -fold reported in the literature. 
Even when these expression changes are statistically significant, 
it is not always clear if they are biologically meaningful. When 
comparing expression levels of disease to normal tissue, one 
expects an enrichment of known disease-related genes to 
appear in the altered expression group. MedCene provided a 
unique opportunity (o test this notion in the context of existing 
knowledge on a novel breast cancer micro-array dataset. For 
genes displaying a 5- fold change or less in tumors compared 
to normal, there was no evidence of a correlation between 
altered gene expression and a known role in the disease. This 
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Table 3. Genes with Large Expression Changes in ER- but 
Not in ER-r Breast Tumors 



i$t:ne symbol fold change (ER+) fold change (EK-) 



KRTHBl 1-0 610.8 

BRS3 t.2 89.4 

DKKl 1.2 69,8 

ZlCl 1.9 59.6 

m?i . 1.0 38.5 

KIAA0U80 2.6 33.2 

CDKAtf 1.0 30.6 

EB12 4.0 27.9 

CZM£I 3.8 21.9 

4.7 18.6 

C7/>/?42 1.0 14.6 

MYOJ0 1.6 14.4 

-1.0 13.5 

4.2 13.0 

HMC4 14 12.9 

BCL2L11 -1.2 12-3 

t/?P* 2.9 12.2 

1.0 11.8 

CCMT2 4.0 11.6 

FCB -4.3 11.1 

2.9 10.9 

HIF5 3.0 10.2 

SERPINH2 4.6 10.2 

K4P/ 1-0 10,0 

LPHB -1.3 -10.4 

TCEAZ -1.1 -10.8 

7m 1.3 -H.4 

COL17AI -4.1 -15.7 

/W5 1.1. -16.2 

BPACt -4.6 -22.3 

/WfO -i.l ~36.8 

VECFC -2.8 -51,5 

MUC6 -1.4 -04,9 

SERP1NA5 -10 -83.1 

M£7S/ -1.6 ~85,9 ' 

OU* 2.4 -150.3 



Table 3. Med Gene identified a set of relatively understudied, yet highly 
.expressed genes in EK negative, but not ER positive breast tumors. All of 
these genes have either never been co-cited with breast cancer or have a 
weak association except those marked with an \ 



reflects the many genes whose role in breast cancer may not 
involve large changes in expression In sporadic tumors (e.g., 
BRCA1 and BRCAZ) and genes whose modest changes in 
expression may be unrelated to the disease. Strikingly, among 
genes with a 10-fold change or more in expression level, there 
was a strong and significant correlation between expression 
level and a published role in the disease, providing the first 
global validation of the micro-array approach to identifying 
disease-specific genes, 

The results derived from MedGene have two implicit lions. 
First, a careful hunt for corroborating evidence of a role in 
breast cancer should precede any further study of genes with 
less than 5 -fold expression level changes. Second, any genes 
with 10- fold changes or more are likely to be related to breast 
cancer and warrant attention. It is likely that this threshold will 
change depending on the disease as well as the experiment. 

Interestingly, the observed correlation was only found among 
ER-positlve tumors, not ER- negative. This may reflect a bias 
in the literature to study the more prevalent type of tumor in 
the population. Furthermore, this emphasizes that caution 
must be taken when interpreting experiments that may contain 
sub populations that behave very differently. The MedCene 
approach identified a set of relatively understudied, yet highly 
expressed genes in ER- negative tumors that are worthy of 
further examination (Table 3). 



In conclusion, we have developed an automated method of 
summarizing and organizing the vast biomedical literature. To 
our knowledge, the resulting database is the most comprehen- 
sive and accurate of its kind. By generating a score that reflects 
the strength of the association, it provides an important tool 
for the rapid and flexible analysis of large datasets from various 
high-throughput screening experiments. Furthermore, it can 
be used for selecting subsets of genes Tor functional studies, 
for building disease-specific arrays, for looking at genes com- 
mon to multiple diseases and various other high- throughput 
applications. In tiie future, it will be possible to enhance the 
utility of the MedGene database by building links between 
genes and other MeSH terms as well as other biological 
processes and concepts, such as cell division and responses to 
small molecules. 
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ABSTRACT The consistent cytogenetic translocation of 
chronic myelogenous leukemia (the Philadelphia chromosome, 
Ph 1 ) has been observed In cells of multiple hematopoietic 
lineages. This translocation creates a chimeric gene composed 
of breakpolnt-cluster-reglon (bcr) sequences from chromosome 
22 fused to a portion of tixcabl oncogene on chromosome 9. The 
resulting gene product (P2I0 C ** M ) resembles the transforming 
protein of the Abelson murine leukemia vims in its structure 
and tyrosine kinase activity. P210 c '* w Is expressed In Ph 1 - 
positive cell lines of myeloid lineage and In clinical specimens 
with myeloid predominance. We show here that Epstein-Barr 
virus-transformed B -lymphocyte lines that retain Ph 1 can 
express P210 c ' aM f The level of expression in these B-cell lines is 
generally lower and more variable than that observed for 
myeloid lines. Protein expression is not related to amplification 
of the abl gene but to variation in the level of bcr-abl mRNA 
produced from a single Ph 1 template. 



Chronic- myelogenous leukemia (CML) is a disease of the 
pluripotent stem cell (1). In greater than 95% of patients, the 
leukemic cells contain the cytogenetic marker known as the 
Philadelphia chromosome, or Ph 1 (2). This reciprocal 
translocation event between the long arms of chromosomes 
9 and 22 has been used as a disease-specific marker for 
diagnosis and evaluation of therapy. Multiple hematopoietic 
lineages, including myeloid and B-lymphoid, contain Ph 1 in 
early or chronic phase, as well as in the more acute accel- 
erated and blast crisis phases of the disease. 

One molecular consequence of Ph 1 is the translocation of 
the chromosomal arm containing the c-abl gene on chromo- 
some 9 into the middle of the breakpoint-cluster region (bcr) 
gene on chromosome 22 (3-6).' Although the precise 
translocation breakpoints are variable, an RNA-splicing 
mechanism generates a very similar 8-kilobase (kb) mRN A in 
each case (5-9). The hybrid bcr-abl message encodes a 
structurally altered form of the abl oncogene product, called 
P210 c ** bl (10-13), with an ammo-terminal segment derived 
from a portion of the exons of bcr on chromosome 22 and a 
carboxyl-terrninal segment derived from a major portion of 
the exons of the c-abl gene on chromosome 9. The chimeric 
structure of bcr-abl and the resulting P210 c '* bI is similar to the 
structure of the Abelson murine leukemia virus gag-abl 
genome and resulting P160 v " abt transforming gene product. 
Both proteins have very similar tyrosine kinase activities (10, 
11, 14) which can be distinguished by their relative stability 
to denaturing detergents and by their ATP requirements from 
the recently described tyrosine kinase activity of the c-abl 
gene product (15). 



The publication costs of this article were defrayed in part by paye charge 
payment. This article must therefore be hereby marked "advertisement" 
in 'accordance with 18 U.S.C J 1734- solely to indicate ihis fact. 



In concert with structural modification of the ami no- 
terminal portion of the abl gene, increased level of expression 
has been implicated in activation of c-abl oncogenic poten- 
tial. Myeloid and erythroid cell lines and clinical samples 
derived from acute-phase CML patients contain about 10- 
fold higher levels of the 8-kb bcr-abl mRN A and P210 c, * bi than 
the c-abl mRNA forms (6 and 7 kb) and Pl45 c *» bJ gene product 
(5, 8, 9, 11). The higher level of expression of the chimeric 
bcr-abl message in acute-phase cells is not likely to be solely 
due to the presence of the bcr promoter sequences at the 5' 
end of the gene, since the normal 4,5-kb and 6.7-kb bcr 
encoded mRNA species are expressed at an even lower level 
than the normal c-abl messages (5, 6). 

We have analyzed a series. of Epstein-Barr virus-immor- 
talized B-lymphoid cell lines derived from CML patients (16). 
With such in vitro clonal cell lines, we can evaluate whether 
the presence of Ph 1 always results in synthesis of the chimeric 
bcr-abl message and protein, and whether the quantitative 
expression varies for cells of B-lymphoid lineage as com- 
pared to previously examined myeloid cell lines. Our results 
show that cell lines that retain Ph' do express bcr-abl message 
and protein, but that the level is generally lower and more 
variable than previously seen for myeloid' cell lines. The 
demonstration that the Ph 1 chromosomal template can vary 
in its level of expression of P2]0 c ' abI suggests that secondary 
mechanisms, beyond the translocation itself, contribute to 
the regulation of the bcr-qbl gene in different cell types or 
subclones that derive from the affected stem cell. 

MATERIALS AND METHODS 

Cells and Cell Labelings, Epstein-Barr virus -transformed 
B-lymphoid cell lines were established from peripheral blood 
samples of chronic- and acute-phase CML patients as report- 
ed (16). The cell lines are designated according to patient 
number, karyotype, and lineage. For example, SK- 
CML7Bt(9,22)-33 refers to CML patient 7, B-lymphoid cell' 
line, 9;22 translocation (Ph 1 ), cell line 33; and SK-CML7BN- 
2 refers to B-cell line 2 with a normal karyotype derived from 
the same patient. Repeat karyotype analysis was performed 
to verify the retention of Ph 1 just prior to analysis for abl 
protein and RNA. Cells were maintained in RPMI 1640 
medium with 20% fetal bovine serum, We have not observed 
any consistent pattern of in vitro growth rate that correlates 
to the stage of disease at the time of transformation with 
Epstein-Barr virus. Cells (1.5 x 10 7 ) were washed twice with 
Dulbecco's modified Eagle's medium lacking phosphate and 
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Table 1. Relative levels of bcr-abl expression in Epstein-Barr 
vjrus-immortalizcd B-ceU lines and myeloid CML lines 



Cell line* 


CML phase 1 


Ph 1 * 


P2101 


mRNA* 


SK-CML7BN-2 


BC 








SK-CML8BN-10 


Chronic 








SK-CML8BN-12 


Chronic 








SK-CMU6BN-1 


Chronic 








SK-CMU5BN-1 


Chronic 








SK-CML7B5-33 


BC 


+ 


+ + + 


4-4-4- 


SK-CML21BM 


Acc 




+ + 4" 


+ + + 


SK-CML21Bt-$ 


Acc 


+ 


+ + + 


+ + + 


SK-CML8Bt-3 


Chronic 


+ 




± 


SK-CML16BM 


Chronic 


+ 


-r 


•f 


SK-CML35BI-2 


Chronic 






4- 


K562 


BC 


+ 


+ + + + + 


4-4-4- + 4- 


BV173 


BC 


+ 


+ + 4- + + 


+ 4- 4- + + 


EM2 


BC 




■+ + + + + . 


+ +4-4- + 



*Cclt lines derived from CML patients by transformation with 
Epstein-Barr virus as described (16). Names of cell lines indicate 
patient number and Ph 1 status: SK-CML7BI indicates a cell line 
derived from patient 7 that carries the 9;22 Ph 1 translocation; N 
indicates a normaj karyotype. Myeloid-erythroid cell lines (K562, 
EM2, and BV173) axe de-scribed in previous publications (9, 11, 22, 
33). 

Status of patient at the time cell line was derived. BC, blast crisis; 
Acc, accelerated phase. 

^Presence (4-) or absence {-) of Ph 1 as demonstrated by karyotypic 
or Southern blot analysis. 

5p210 c "* bl detected as described in legend to Fig. 1. B-ccll lines 
derived from blast-crisis and accelerated-phase patients had levels 
of P210 3- to 5-fold higher (4- + 4-) than levels of P145. Chronic- 
phase-derived cell lines had P210 levels lower than orjust equivalent 
(+) to the level of P145. Myeloid and erythroid lines had levels of 
P210 5- to 10-fold higher than P145 (4- + + + 4-). 
^Eight-kilobasc bcr-abl mRNA detected as described in legend to 
Fig. 2. Symbols: :t, borderline detectable; + 4-4- + + , level of 8-kb 
mRNA 5- to 10-fold higher than that of the 6- and 7-kb c-cW mRNA 
species; + + 4 , level of 8-kb mRNA 3- to 5-fold higher than that of 
the 6- and 7-kb species; 4- , a level approximately equivalent to that 
of the 6- and 7-kb messages. 

data not shown). There was no difference in the copy number 
of a6/-reiated sequences as judged by Southern blot analysis 
(Fig. 4). Only the K562 cell line control showed an amplifi- 
cation of abl sequences, as previously reported (22, 23). 
These combined data suggest that differential bcr-abl mRNA 
expression from a single gene template is responsible for the 
variable levels of P210 CHlbt detected. This could be mediated 




P210 



P145 



Fig. 2. Analysis of stead y.state abl protein levels by immuno- 
blotting. Cell extracts prepared from 2 x 10* cells of lines SK- 
CML7BN-2 (A,-), SK-CML7Bt-33 (A, 4-), SK-CML8BN-10 (B,-), 
and SK-CML8Bt-3 (2f, + ) were, concentrated by immunoprerip- 
itation with anti-pEX-2 plus anti-pEX-5, Samples were then electro- 
phoresed in a NaDodSO-i/8% polyacrylamide gel and transferred to 
nitrocellulose, using protease-facilitated transfer (18), abl proteins 
were detected using a mixture of two monoclonal antibodies directed 
against the pEX*2 and pEX-5 aW-protein fragments produced in 
bacteria (19) as a probe and a peroxidase-conjugated goat anti-mouse 
second-stage antibody (Bio-Rud) for development. 



A 

- +■ 

LI'-? 



kb 



- 7 
-6 



Fig. 3. Comparison of abl RNA levels in Ph l -positive and 
-negative B-cell lines. The levels of the normal 6- and 7-kb Q-abl 
RNAs and the S-kb bcr-abl RNA were analyzed by blot hybridization 
using a v-abl probe. RNA was extracted from Ph^negative lines 
SK-CML7HN-2 {A,-) and SK-CML16BN-1 (/?.-), from Prepos- 
itive lines SK-CML6Bt-33 (A,+) and SK-CML16Bt-3 (5,4-), and 
from line K562 (C, + ) by the NaDodSO,/urea/phenol method (20). 
Polyadenylylated RNA was purified by oligo(dT) affinity chroma- 
tography, and 15 jig of each sample was clectrophoresed in a 1% 
agarose/formaldehydc gel and then transferred to nitrocellulose. The 
blotted RNAs were hybridized with a nick-translated v-abl fragment 
probe (21) and then autoradiographed for 4 days. 



by factors influencing the transcription rate of the bcr-abl 
gene or the stability of the mRNA. 

DISCUSSION 

Se veral lines of evidence suggest that formation of Ph 1 is noi 
- the primary event that affects the stem cell in CML. Patients 
have been identified that present with the clinical picture of 
CML but only later develop Ph 1 (1), This observation, 
coupled with studies of G6PD (glucose-6-phosphate dehy- 
drogenase)-heterozygous females with CML that demon- 
strate stem-cell clonality by isozyme analysis among, cell 




Fjg. 4. Southern blot analysis of abl sequences in Pr^-positive 
and -negative B-cell lines. High molecular weight DNA (15 Kg) was 
digested with restriction endonuclease BamlU, separated in a 0,8% 
agarose gel, and then transferred to nitrocellulose. The blotted DNA 
. fragments were hybridized with a nick-translated, 2.4-kb B$l 11 v-abl 
fragment (1.5 x 10* cpm/^g; ref. 21) and exposed for 4 days. (A) 
Autoradiogram of o6/-specific fragments in cell lines HL-60(lane 1), 
EM 2 (lane 2), K562 (lane 3), SK-CML7BI-33 (lane 4), SK-CML8BI-3 
(lane 5), SK-CML16BM (lane 6), SK-CML21BI-6 (lane 7), SK- 
CML35BI-2 (lane 8), SK-CML7BN-2 (lane 9), SK-CML8BN-2 (lane 
10), and SK-CML35BN-1 (lane 1 1). W) Ethidium bromide staining of 
agarose gel prior to transfer to nitrocellulose, showing the level of 
variation in amount of DNA loaded per lane. 
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populations that lack the Ph 1 marker, supports a secondary 
or complementary role for Ph' 1 in the progression of the 
disease (24, 25), This chromosome marker is found in 
chronic, accelerated, and blast-crisis phases of the disease. It 
is likely that Ph 1 confers some growth advantage, since cells 
^iiith the marker chromosome eventually predominate the 
~lmarrow and peripheral blood even in chronic phase. During 
She phase of blast crisis, many patients develop additional 
'Chromosome abnormalities, including duplication of Pb\ a 
^variety of trisomies, and complex translocations (26). This 
is suggestive evidence for Ph 1 being a necessary but not 
sufficient genetic change for the full evolution of the 
disease. 

The realization that one molecular result of Ph 1 is the 
generation of a chimeric bcr-abl protein with functional 
characteristics and structure analogous to the gag-abl trans- 
forming protein of the Abel son murine leukemia virus 
strengthens the argument for an important role of Ph 1 in the 
pathogenesis of CML. Although the Abelson virus is gener- 
ally considered a rapidly transforming retrovirus, its effects 
can range from overcoming growth factor requirements, to 
cellular lethality, to induction of highly oncogenic tumors in 
a number of hematopoietic cell lineages (27, 28). Even in the 
transformation of murine cell targets, there are several lines 
of evidence that suggest that the growth-promoting activity of 
the \'-abi gene product is complemented by further cellular 
changes in the production of the malignant-cell phenotype 
(29-31), 

The regulation of bcr-abl gene expression is complex 
because the 5' end of the gene is derived from the non-abl 
sequences, bcr, normally found on chromosome 22 (6). The 
level of stable message for the normal bcr gene and the 
normal abi gene are both much lower than the level of the 
bcr-abl message and protein from cell lines and clinical 
specimens derived from myeloid blast-crisis patients (5, 6, 
11), Therefore, the high level of bcr-abl expression cannot 
simply be attributed to the regulatory sequences associated 
with bcr. Possibly, creation of the chimeric gene disrupts the 
normal regulatory sequences and results in a higher level of 
expression. Variation in bcr-abl expression may result from 
secondary changes in the structure of the chimeric gene or 
function of rran j-acting factors that occur during evolution of 
the disease. Our analysis of P210 c '* bl and the 8-kb mRNA in 
Epstein-Barr virus-transformed iV-positive B-cell lines 
demonstrates that stable message and protein levels from the 
bcr-abl gene can vary over a wide range. This variation does 
not result from a change in the number of bcr-abl templates 
secondary to gene amplification but more likely from changes 
in either transcription rate or mRNA stability. We suspect 
this range of bcr-abl expression is not limited to lymphoid 
cells. Analysis of peripheral blood leukocytes derived from 
an unusual CML patient who has been in chronic phase with 
myeloid predominance for 16 years showed a level of 
P210 c ~ abl one-fifth that of P145 c ' abl , as detected by metabolic 
labeling with ["Pjorthophosphate and immunoprecipitation 
(S.C., O.N.W., and P. Greenberg, unpublished observa- 
tions). Lower levels of expression of the chimeric mRNA 
have been demonstrated in clinical samples from chronic- 
phase CML patients compared to acute-phase CML patients 
(9). Others have reported chronic-phase patients with vari- 
able but, in some cases, relatively high levels of the bcr-abl 
mRNA (32). The sampling variation and the heterogenous 
mixture of cell types in clinical samples complicate such 
analyses. Further work is needed to evaluate whether there 
is a defined change in P210 c ' abl expression during the pro- 
gression of CML. It is interesting to note that among the 
limited sample of Ph^positive B-cc\\ lines we have examined 
(Table 1), we have seen higher levels of P210 c ' Bbl in those 
derived from pativ i at more advanced stages of the disease. 



It will be important to search for cell-type-specific mecha- 
nisms that might regulate expression of bcr-abl from Ph 1 . 
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Although the mature neutrophil is ono of 
the hotter characterized mammalian cell 
types, the mechanisms of myeloid differ- 
entiation are incompletely understood at 
the molecular level. A mouse promyelo- 
cyte ceil line (MPRO), derived from mu- 
rine bone marrow cells and arrested devel- 
opmentally by a dominant-negative 
retinotc acid receptor, morphologically 
differentiates to mature neutrophils in the 
presence of 10 jtlVI retinoic acid. An exten- 



sive catalog was prepared of the gene 
expression changes that occur during 
morphologic maturation. To do this, 3 - 
ond differential display, oligonucleotide 
r.hfp array hybridization, and 2-dimen- 
siona) protein electrophoresis were used, 
A large number of genes whose mRNA 
levels are modulated during differentia- 
tion of MPRO cells were identified. The 
results suggest the involvement of sev- 
eral transcription regulatory factors not 



previously implicated in this process, hut 
they also emphasize the importance of 
events other than the production of new 
transcription factors. Furthermore, gene 
expression patterns were compared at 
the level of mRNA and protein, and the 
correlation between 2 parameters was 
studied. (Blood, 2001;90:513-524) 
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Introduction 



Studies of normal myeloid maturation from many laboratories have 
identified genes that may play critical roles in mydoi^difft^entm- 
tion. M Current studies suggest that these events arc dependent on a 
cascade of molecular changes that involve complex modulation of 
mRNA transcription. Furthermore, studies of acute leukemia have 
suggested that the disease arises from the accumulation of myeloid 
precursors arresied at early stages of differentiation and associated, 
in many cases, with chromosomal rearrangements that alter the 
structure of specific transcription factors. 5 Nevertheless, the molecu- 
lar events underlying the production of mature myeloid cells arc 
not well understood and appear to use interacting pathways and 
networks, the elucidation of which requires an extensive descrip- 
tion of the molecular components available to the myeloid cell. 

An extensive body of information is accumulating with respect 
to gene expression profiles of mammalian cells. However, much of 
the information available in public databases has been accumulated 
by the use of techniques such as single oligonucleotide chips or 
cDNA arrays that measure fewer than 6000 of potentially 30 000 to 
1 20 000 transcripts. ITie more limited range of analyses reported by 
the serial analysis of gone expression (SAGH) 6,7 technique accu- 
rately estimates changes in levels of the more abundant mRNAs but 
requires extensive redundant analyses to measure changes in the 
patterns of expression of scarce mRNAs. We have used a modified 
polymerase chain reaction (PCR)-based cDNA differential display 
(DD) method in which single restriction fragments derived from 
the V end of cDNAs are separated on a sequencing gel.*- 1 ' Bands 
from the gel can be identified initially by sequencing, but thftn 



comparison of patterns from different samples can be made without 
further sequencing. This sensitive and reproducible method detects, 
"In principle, mostTDNAFii^aJclless of ^lmm*ei^w~are re pro- " 
sented in existing databases. 

Systematic analysis of the function of genes can also be 
performed at the protein level. This approach has the advantage of 
being closest to function, because proteins perform most of the 
reactions necessary for the cell. The mo.it common method of 
proteome analysis is the combination of 2-riimensional gel electro- 
phoresis (2DE) to separate and visualize protein and mass spectrom- 
etry (MS) for protein identification. 10 Several such analyses of 
yeast and of normal or malignant mammalian cells have been 
performed. To date, however, there have been few studies in which 
both mRNA and protein have been compared by applying analyses 
lo the same samples. The studies of Anderson" and Gygi 12 showed 
that there is not a good correlation between mRNA and protein 
levels, in yeast or human liver cells. However, other analyses 
disagree with this conclusion (Grecnbauin et al, manuscript 
submitted, and Fuioher et al 14 ). Furthermore, global correlations 
between changes in mRNA and protein levels have not been 
examined during the execution of any developmental program. 

The MPRO cell line was derived by transduction of a dominant- 
negative retinoic acid receptor construct into normal mouse bone 
marrow cells. It is a granulocyte-macrophage colony-stimulating 
factor (GM-CSF)-dcpendwu line arresUid at a promyelocyte stage 
of development.' 3,16 After treatment with alU/ron.v retinoic acid 
(ATKA) most of the cells acquire the morphology of mature 
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neutrophils and bug in to produce neutrophil lactofcrrin and gclati- 
nase, 2 proteins characteristic of neutrophil secondary granules.' 7 
As such, it offers a valuable model for studying neutrophil 
differentiation in vitro. 

We now report the analysis of mRNA expression changes 
during the process of MPRO cell maturation to neutrophils and 
compare the results with a limited analysis of cellular protein 
composition. inRNA expression changes were studied by combin- 
ing the use of oligonucleotide arrays and DD. A database (dbMC) 
with comprehensive genomic information for myeloid differentia- 
tion program was constructed (accessible at http://www.bioinfo.mbb. 
yale.edu/expressioii/neutrophil). We have grouped the changes in 
mRNA levels of a large number of genes into 6 patterns, with 
implications for the genetic program of myeloid differentiation. 

We also compared 2 -dimensional high- resolution gel electro- 
phoretograms from control cells and cells differentiated for 72 
hours in the presence of ATRA. Fifty protein spots whose relative 
intensity changed prominently during differentiation were exam- 
ined by mass spectrometry. The results suggest a poor correlation 
between mRNA expression and protein abundance, indicating 
that it may be difficult to extrapolate directly from individual 
mRNA changes to corresponding ones in protein levels (as 
" esTim aTorTTr 67n 71 7K)~ ™ _ 



Materials and methods 

Cell lines 

MPRO cells and HM-5 cells provided by Or Schick wan n Tsai (Prcri 
Hutchinson Cancer Research Center, Seattle, WA) ,S were, used throughout 
the study. The ceils proliferated continuously as a GM-CSF-de pendent cell 
line at ,17°C ill Iscoves modified Dulbeeco medium (Gibco BRL, Grand 
Island, NY) supplemented witli 5% to 10% fetal calf serum (Oibco BRL) 
and 10% HM~5~conditioned medium as n source of GM-CSF. Morphologic 
differentiation of the blocked MPRO promyelocytes was induced by 
treatment with 10 ilM ATKA (Sigma, St Louis, MO). Controls were 
cultured in the absence of ATRA but with the same volume of ve- 
hicle (ethanol). 

RNA isolation and differential display 

After exposure to 10 u,M ATRA tor 0, 24 1 48, oi 72 hours, total cellular 
RNA was isolated from MPRO cells using TKIzol reagent (Lite Technolo- 
gies, Gaithersburg, MD). cDNA whs then synthesized using a T-7 Sal-Oligo 
d(T) .12 primer as described previously. K,,K The double- stranded cDNA was 
digested with I of 0 different restriction 'enzymes {Aftal, BgfXl, BumHl, 
■EayX EcaR\ t WfndMI. Xhul, K(>n\, and Sph\) and ligatfd to Y-shaped 
adaptors with a complementary overhang. DNA fragments were then 
amplified by PCR as described previously. h ' IH PCR products were separated 
on h sequencing gel of 6% polyacryliunide with 7 M urea. The gel was dried 
and exposed to x-ray film. Genes from differential display gels, whose 
maximum intensity changes equaled 2f on a scale of I f to K + , were 
recorded as significantly change d. ig Individual DNA bands were recovered 
from the gels, amplified by PCR, and sequenced. 

Oligonucleotide chip analysis of RNA samples 

Ten micrograms total RNA from each sample (0, 24, 48, or 77. hours) was 
used to prepare cDNA. This cDNA was transcribed with T7 RNA 
polymerase to prepajc a fluorescent I y labeled probe. 211,2 1 F.iach sample was 
hybridized to mouse array chip (MuMK. Array; Aflymetrix, Santa Clam. 
CA) containing oligonucleotide probe sets corresponding to approximately 
7000 known genes or ESTs represented by UniGene clusters." cONAs 
were considered present if their probe set results were rated as such by t lie 
GeneChip software {Aflymetrix) and if the overage difference (AO) 
between perfect match and mismatch probe pairs was not less 100 U. If a 



gene was lepresenled by more than om: array probe set. the average of all 
probe sets foi the gene was taken. Genes with AD values between 100 and 
200 were considered unchanged because of fheir low expression levels. 
Those genes with AO values equal to or more than ?,00 U at one lime point 
were fuitlier studied by reseating, threshold, and normal nation methods 
described in the MIT Center for Genome Research Web site, 11 A value of 20 
was assigned to any gene with an AD below 20 af some time point. 

Bioinformatics and database development 

All the sequences or gene fragments were searched using Blast against 
Geo Bank and TIGR gem: indices. A database of genes or ESTs whose 
expression levels changed during myeloid differentiation was constructed 
containing information for each band or gene. This included GenBaux 
matches, Locus Link or Umgenc clusters, expression patterns, tissue 
distribution, synonym(s) protein name, gene najne(s), notations of possible 
functions, poiy A signal and sequence quality, and hyperlinks to the 
database searches, sequence trace files, and related references. AH gene data 
were then gathered into o cluster file. Supplementary Information is 
available at http //hioinfo.rnbb.yale.edti/eApa- | *sion/neutrophil 

Classification and analysis of DNA fragments 

Sequenc es from differen tial display analyses wer e classified »s_re Pjejtti\tin£_ 



known genes, ESTs, genomic sequences, or novel genes as described. 19 - 2 
Known genes from both differential display and arrays were clustered into 
27 functional categories and seat died against SAVI5S-PR.OT (http:// 
www.expasy.cbr.urc.ca/cgi-bjii/spiOt-seaich-ful) or PIR (httpV/www.pir. 
georgctown.edu/). In formation such as function, subcellular location, 
family and stipei family classification, map position, similarity, syiionym(s) 
protein name, gene nauie(s), and so on w,is lecorded in a variety 
of da (aliases. 



Northern blot analysis 

Thirty micrograms total cellular UNA per lane from time -course MPRO 
cells were loaded onto 1.2% fonnaldehyde-agitrose gels, then transferred to 
Kyboud-N + membranes (Amersharn Pharmacia Biotech, Uppsala, Swe- 
den). After standard pre hybridization, membranes were hybridized over- 
night at 65 P C with radiolabeled cDNA probes (ordered from Research 
Genetics according lo their dbEST Image ID). Membranes were washed at a 
final stringency of 60°C in 0.1 x SSC 

Immobilized pH gradient 2-dimensional rie( electrophoresis 
and mass spectrometry 

Induced MPRO cells collected at I) and 72 hours were lysed with lysis 
buffer (540 mg urea, 20 mg dithiothjeitnl, 20 pL Phaminlytc 1 3-101, 1.4 mg 
phenylmethylsulfonyl fluoride, 1 |tg each Hprotiuin, leupeptin, pepstatin A, 
and autipain 50 u-gfLCK, and 100 p.g TPCK/1 mL). We applied 100 f tL 
each MPRO cell lysate (2.5 x I0 f, celIs/IO0 u.L) to immobilized pH 
gradient (IPG) strips (pH 3-10 L; Amershmn Pharmacia Biotech), and IPG 
electrophoresis was conducted for 16 hours (20 100 Vh) using an Jmmobi- 
linc Drystrip Kit (Amersharn Pharmacia Biotech). Electrophoresis in the 
second dimension was carried out in a 12% sodium dodecyl sttlfate- 
polyacrylami.de g«I electrophoresis (SOS-PAGE) gel with the Laemmli- 
.SPS continuous system in a Protean II ,\i 7.-D cell (i'lio-Rad) run at 40 mA 
constant current for 4.5 hours. Proteins were detected by Brilliant Blue 
G-cotloidal staining. ^ Protein spots were excised from the gcj and digested 
with trypsin. ACTU clip (aveiage [M + HJ 2466.70) and bradykinin 
(average (MM TI| 106T.2H) were used for caltbiation nf peptide masses. One 
microliter sample digest was mixed with 1.0 p.L ir-eyano-4-hydroxy 
eiunnmicacid (4.5mg/mL iri 50% CH 3 CN, 0.05% Tl-'A) matrix solutiou and 
1 pL calibrauts (100 IVnol) each. The spectra of the. peptides were acq u tied 
in reflector/delayed extraction mode on a Voyager- OH STK mass spectrom- 
eter (Perseptive Biosysfems, Poster City, CA). Peptides were identified 
using the Propound search engine.^ 
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Results 

Dlffnrcntiation of MPRO cells 

Figure I illustrates tlie morphologic changes in an MPRO cell 
population representative of those used Tot RNA expression 
analysis. Undifferentiated MPRO cells resembled promyelocytes 
under the light microscope (Figure 1 A). After induction with ATRA 
for 24 hours, the cells morphologically differentiated into metamy- 
elocytes (Figure IB). At 48 hours, the cells further developed 
into metamyelocytes and band neutrophils (figure 1C). At 7? 
hours, nearly 100% of MPRO cells became mature neutrophils 
(Figure ID). 

Identification of mRNAs by differential display assay 

MPRO cellular mRNA was analyzed at 0, 24, 48, and 72 hours after 
ATRA treatment. Nine restriction enzymes were used in a 3 '-end 
DD approach. During MPRO differentiation, 1109 fragments 
corresponding to 837 transcripts were found to change substan- 
tially in expression levels (Figure 2). These represented approxi- 
mately 279 known genes, 112 IZSTs, and 59 putative new genes, 
■ -ea ch- wrtlr*-p er f bet -o r~fa if poj yadenyiat i on-si gna Hi t-an-np p ropriate- 
di stance from the oligo-dT priming site. The gene, information 
detected by DD was collected in database dbMCd. 

Identification of mRNAs by oligonucleotide chip ass:iy 

We used an oligonucleotide chip containing 13 179 probe sets 
corresponding to approximately 7000 murine- genes to analyze 
patterns of mRNA expression in the same RNA samples used for 
DD. The information obtained by oligonucleotide arrays was 
collected in the database dbMCa. 

We clustered the genes by their similarity to idealized 
expression patterns. For instance, the expression pattern of an 
ideal gene that is overex pressed (high) at time 0 and underex- 
pressed (low) at 24, 48, and 72 hours, would be high-low-low- 
low (HLLL). Overall we have (2 4 -2) idealized patterns exclud- 
ing HHHH and LIXL. Pearson correlation was used as the 




Figure 1. Morphology of MPRO cells during differentiation. MPRO r-clls were 
induced as doaaibtid in 'Materials and methods." concenlmtud by cytospin, ami 
Wrighl-Giemsa stained, (A) Un induced MPRO cells. (B) MPRO cells tnducud with 
ATRA for 24 Hours. (C) MPRO cuPs Induced with ATRA for 48 hours. (0) MPRO cells 
inclunRdwKhATRAfor72 hums. 




Ftguro 2. Distribution of p,enos obtained by DD assay. MPRO cell mRNA was 
- analyzed ai A-21,_4fU.ajKJ-22Jicwr^2fcer_AIJlk_ir^ 
corresponding to 837 trmwr/ipts wore found to change substantially In e^prussion 
lKV»rfs. The tola! B37 transcripts wore dassifiod info b' categont's aixordinp to trie 
hk>ln forma tic analysis. PcrcuntagRS show the gone distributions in these 6 calugo- 
fies. Information for e:ich transr.ripi was col loan d in iJaUioase dbMCti. 

measure of similarity of each Rene expression pattern, 
x — (xi,x 2 ,\-\,x A ) to each of the 14 idealized patterns 
y - (ViiVyj'.ij'O- The 4 entries of x and y corresponded to the 
'1 -dimensional gene expression levels at 0, 24, 48. and 72 hours, 
respectively. Each gene was assigned to a cluster labeled by the 
idealized pattern that had the maximal correlation with that 
gene. We selected only genes that hybridized well compared 
with the background (considered "present" by GcncChip soft- 
ware) and had maximal AD amplitude greater than 200 U in at 
least I of the 4 stages. We further tabulated the 14- patterns 
according to whether the gene expression changed at early 
(0-hour), intermediate (24- and 48-hour), and late (72-hour) 
tune points and whether gene expression inouotontcally in- 
creased (up-regulated), monotonically decreased (down-regu- 
lated), or was not monotonia (transient). Table 1 shows 8 
clusters of 104 genes that had significant changes of mRNA 
levels, arranged according to the temporal stage and the 
monotonic/transient changes of expression levels 

Principal component analysis det em lined whether we could 
comprehensively present multidimensional data (4-dimensional in 
our case) in a simple 2 -dimensional giaph. First, we found the 4 
principal components, which were the axes of the most compact 
4-dimensional ellipsoid that encompassed the 4-dimcns*tonal cloud 
of data. Bach axis was a different linear combination of the original 
4 variables. Then we verified that the first 2 principal components 
(the first 2 largest axes of the ellipsoid) captured most (95.2%) of 
the variation of the data. Therefore, the (lata could he faithfully 
projected (with a minor loss of information) into a 2-dimensional 
graph, with the 2 largest principal components as the x- and y-axes. 
As shown in Figure 3, genes tend to coalesce in clusters, according 
to their labels determined by their similarity to an ideal expression 
pattern. In summary, a genomic (global) picture of the distribution 
of genes nccording to theii similarity to predetermined idealized 
multidimensional expression patterns is concisely displayed in a 
2-dimensional graph. 
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Table 1. Gwnes differently regulated during th» different stages d! mutisK promyelocyte c«ll linn differentiation pruce.ss 



Timing 



Category 



Early 



Up-rugu Inl ion 


LHHH (n ~ U)) 


LLHH (n = fi) 


LLl.H (it - 13) 




AAndP2rx1 it{jb2 111 fl Lcn2 Hptf 


F'irviCybb Pfc Pirn 5 Cc53 »'0f/r.7 


mo Cxfli li a?ysioon8 l.-CCR Ctss 




Ct-bpb H2-D Bohifl Zy* 




Aldol R.*c? Kprl Ctsrf Ubb Plmh4 


Down-r emulation 


HUL (n = 11) 


HHLL(n - 1) 


HHHL (n - 37) 




Tcig-V4 lyf>4 Clag Spi2-1 McpW 


Mpn 


Acl> frf2EL2 RpMS Antb Ly6« AM J Hist2 




My^Myb |tr4 Nprnl lirh HspBO 




P>;ma^ Gn;is : ;t>r Hhftigl 
Max RpsflCr^iblSlpi laexl Tpi utt'J 
Cmt Cys3 SlelOa 1 Ctsb Seppl Rtn3 
Ccnb2 S 1 (X>a9 C(1 t HiM. r >-2ax Reta 
Copa Gsiini Gnb2-r si Grn RPL8 


Transient 




LLHL (n - 9) 

Sell K7.'2Pira6 Pirb Lstt Ltf Sema4d Starri Mmp9 
LHHL (n = 17) 

C&lipn Lyis Fcgr3 Artb Lampl Stat3 Csf^ra Osi 
Actg Stpit Gpx3 Ptprc Prtn3 Irf1 RpsBka t 
Ub4r Mytn 





Arrays ot Asymetrix Mullk containing 13103 probe sets corresponding to 12002 GenBank accessions wbih used for hybridization. Arrays were hybridized with 
strBpla\'icltn-phycoeryUnin (Molecular Probes) bbtin-talielud RNAand scanned. Intensity (or each featufu of tho array was captured using Genechip software (AffyinKirix). and 
a single raw expression level for each gene was d wived from the 20 probe pulis representing each gone using » trim med mean algorithm. For eachnene, an AD of 24-, 4 B>, and 
72-hour samplns was cafibrated by dividing the slope of the linear regression fine lor a graph with the x -axis tho AD of 0-hour probe sets and the y-axis the AD of the respective 
-4«ne-potr«-(24.-48rOf-7a.hnure )-Athrcshold.af 20JJ.v*as.aAsjgnad.l£i aitf.gene.'tf ith a j^lculnttid.LVUarcssjojil^^ because discriminat ion of ex prftjtsjun below this level 

uould not be performed with confidence. 3 * Each genu expression profile was categorized as described in Tables 3. 4, and b. f-cr the 4 time points, the minimum AO of the 
relatively higher group (MIN-H) was divided by the maximum AD of tf»e relatively low group (MAX-l.). and those genus whose MIN-H/MAX-L. greater than 2 were selected as 
meaningfully regulated. Genes wr;ro soitwd In descending ordor based on the MIN-HyMAX-L. Genes in boldface nre those whose etc press ion level was in the lop 20% (ic, 
maximum AO of 4 time points greater than 3000), and genes in italics am those in the bottom 20% ['m, maximum AD ot * time points ler.r. than 300). 1 he differentiation period 
was grouped into 3 stages; early (0-hour), middle (24-hour and 48-hour), end late (72-bour) stages. 

AD indicates average difference; gene symlxifs aie expanded in an Appendix at the end of this article. 




Figure 3. Geno clusters In tho first 2 principal component spaces. Principal 
component analysis allowed us to present the multidimensional data (in this case. 
4 -dimensional data of each gene expression pattern) in a simple 2-d intension u I 
graph. We de lived the 4 principal components, which arc a linear combination of tho 
standardized expression intensities (rero mean and unit variance) at 0. 24, 48, and 
72 hours. Tha first 2 principal components captured most of the variation of the data 
(approximately 85%). Therefore, tho data can b« displayed (with a minor toss of 
information) in a 2iwfuensional yrd|>h. The first and seconn* principal annpononte, ct mul 
r,2, art* given by Uie linr;ar combinations C\ - 0.747 • rrt -0.1t • n? - Q.U56 • n3 0 
n4 and c 2 - 0.2?8 • n1 ■♦■ 0.353 - n2 + 0.233 ■ /;3 - 0.803 ■ n4, whem n\,M,n'4, 
h\m) n4 ate the reseated a ltd standardized wxpresston lovels at 0, 24, 4B, and 72 
hours, resperAively. The axes ley ends cl and c2 stand for the first 2 principal 
components. In this papei we used the Pearson correlation In measure the similarity 
of cjich go no with the idealized expression patterns, as opposed to Uie Hudidean 
disUince wa used in a pievious work J 9 because clusters wtsrn better separated using 
litis measure. In both cases, we piesentedtha data in tfie 2-rtirnenslnnal spactiof tho 
lowest principal components. Tho data had a tendency to be secularly distributed 
when we used tho Pearson correlation as a distance measuie. 



Correlation between array and OD analyses 

Wo litive previously ilemoiijiirated it correlation coefficient o('0.9!5 
between visual estimates of changes in band iiuensity on DO and 
Phosphoriinagcr System (Molecular Dynamics, Sunnyvale. CA) 
cstimatBS of band intensity and a correlation coefficient of 0.88 
between hybridization intensity changes of mRNA on N on hem 
hlot analyses and changes in band intensity on DD. IV In a few cases 
there were clear discrepancies in the patient of expression of a 
Rene, as estimated by DD and by oligonucleotide chip analysis. We 
chose the 6 most extreme cases and examined the levels of niRNA 
change for Uie.se genes by Northern hlot analysis (Figure 4). In S 
cases, the Northern blot results agreed with the results of the DD 
analysis, whereas the results of Gnb2-rsl disagreed with the 
oligonucleotide array but duplicate bands from DO showed a 
relatively high level of expression in the 0 time sample that did uot 
correlate with the Northern hlot (Table 2). One possible explana- 
tion for these findings was the change in the relative use of different 
polyadenylation sites after the addition of ATRA to die MPRO cells. 

Constructing a database for mRNA level chanyes during 
myeloid differentiation 

Based on the data obtained above, an in-house database (dhMC) 
was constructed that included 2 suhdaiabases, dbMCd and dbMCa, 
for collecting gene information from OD or oligonucleotide arrays, 
respectively. Each entry in dbMC is accompanied by a so-called 
executive summary. 'I lie linkage between dbMCd and dbMCa was 
established by UniGene ID and cluster ID. dbMC contains the 
temporal expression patterns of ^enes during the MPRO cell 
differentiation process, including not only products represented in 
public databases but also novel transcripts. 
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Cebpa 




-28S 
-18S 



Figure 4. Northern blot analysis ot sb lectori mRNAs. Equivalent iintoijiils of RNA 
from MPRO nulls induced try ATRAat different time points (0 hour, 24 tiours, 4EJ hours, 
and 72 hours) were resolved by fonnaldehyde-finnrusH gel elactsophniusis, stained 
to verify the amount of loading. Eleven genes were separately prct>«U ok \Uh RNA 
filters. "Pie genu symbol of each piobe was listed at \Uu laft of □ related Northern blot 
result. Detailed Information on these 11 piohfts was listed in Table 5. Ona of thu 
RNA-blotted membrane photographs Is shown with methylene blue-stained 28S and 
1BS RNA subuntts demonstrating- the quality and quantity of RNA loaded in 
individual lanes. 



Analysis of gene expression patterns during MPRO 
differentiation 

Many of the genes identified in this study were fount! in myeloid 
cells or were implicated in myeloid development for the first time. 
We detected 8 cytokines 2 * and chemokines whose mRNA levels 
changed more than 5-fold by arrays and 2-fold by DD during the 
maturation of MPRO cells (see our Web site, http://biomfo.nibb. 



yale.edu/expression/ne.iitrophil). Among these were 2 members of 
the CC chemokine family. Imerloukin-la (\L-\a) was up-regulated 
at the late stage of differentiation (LLI..H pattern, Table I). 

mRNA for approximately 51 receptors was detected hy one or 
the other method. A number of the leccptors known to be present on 
mature neutrophils showed late induction of mRNA, and their 
levels of induction were high, indicating that the expression of 
these products is. a prominent event late in neutrophil maturation 
(Table 3). Rarely was mRNA for receptors down- regulated, 
consistent with myeloid maturation bemg accompanied by increas- 
ing responsiveness of the cell to a variety of externa! stimuli. 

Expression of mRNA for granule proteins 

Neutrophils contain several types of granules that develop at 
different stages of myeloid maturation.* 1 * 17 - 26 Levels of mRNAs 
encoding secondary granule proteins, such as lactoferrin, increased 
as the cells matured (Table 4). The level of mRNA for Mmp9, 
reported as a ternary granule protein, increased markedly between 
24 and 48 hours after the induction of differentiation, whereas 
inRNAs for secondary granule proteins either increased less 
markedly or showed a maximum increase by 24 hours. mRNAs for 
....sj^eraXpn™^ 
cathupsin G, were present in unstimulated cells and decreased as 
the cells matured. There was a discrepancy in the measurements of 
proteoglycan niRNAby DD and oligonucleotide chips, but North- 
ern blots showed that it reached a peak at 'IX hours and then 
declined (Figure 4). Cathepsin D is reported as a primary granule 
protein, but its pattern of mRNA expression more closely re- 
sembled that of secondary granule constituents. In addition to 
known granule components, mRNAs for several other cathepsins 
were up-regti)ated during myeloid differcmintion. in parallel with 
or later than the tertiary granule protein mRNAs. 

mRNAs for transcription factors 

Transcription factor genes, including several identified at the sites 
of consistent chromosome rearrangements in acute myeloid leuke- 
mia, have been implicated in normal myeloid differentiation and in 
the expression of neutrophil proteiiis. W7 I lowever comprehensive 
information concerning the expression of these transcription fac- 
tors during myeloid development is not readily available. There- 
fore, we compared gene names and identifiers in our databases to 
those of the transcription factor database Trans fnc (http:// 



Tablu 2. Expression pattarns of genus detected by Northern blot analysis 



Gene 
symbol 


Gene 
accession 




AD value by array 






In;, 


ensity by DD 




Oh 


24 h 


48 h 


72 h 


U h 


24 h 


48 h 


72 h 


Coops 


M62362 


33 


212 


IB2 


44 










Cebpb 


X62600 


390 


1248 


1380 


tWJ3 










Cebpd 


X618O0 


157 


282 


168 


430 










Cubpe 




















Myb 


M 12848 


892 


356 


230 


435 










Slpi 


U73Q04 


(517 


501 


703 


402 


l 


7. 


3 


3 


Pr?3 




153 


259 


339 


345 


5 


1 


1 


2 


Gnb?-rs1 


X75313 


4231 


3623 


3215 


3403 


4 


4 


1 


1 


Ly6u 


U04288 


3001 


5391 


2844 


1282 


3 


2 


1 


1 


Lspl 


M90316 


65 


376 


040 


28 


2 


3 


5 


6 


At:tb 


X037B5 


3095 


3586 


3976 


2434 


1 


. 2 


3 


2 



Gene symbol and genu accession refer to National Center fof Biotechnology Information da laicises a fit). In particular, to Locus Link. AD valuo Is the average ditteience in 
the value of hybrid Uation intensity butween the set of perfectly matched oligonucleotides and me sot of mismatched olignmirJuoijdo in thu oliyonuclt;otid« array. Band 
intensities liom 0D were swniquantlfiod on a seal*; from 1 {-+•) to 8 ( +■ + +■ ■+ + + Trmse ifstimatet; nrtj shown as bofdfacu numbers in this table." Bnth AD value and 
intensity of yones wore studied at 4 time points corresponding to MPRO cells induced for the indicated times. 

DD indicates differential display; MPRO, mouse pwnyebcyttc cdl line; (or gone symbols, sue the Appendix nt the end of this article. 
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Table 3. Receptors expressed during myeloid differentiation procvss 











At) value hy a nay 




Maximal fold i.hanyti 


Gene symbol 


Gene accession 


Oh 


?4h 


48 h 


72 h 


Less than 2 
















B?rp 


D21207 


64 1 


HSB 


881 


887 




Cmkar4 


X99S8 1 


508 


447 


378 


684 




Crry 


M34 1 73 


433 


384 


SOU 


506 




Csf2rb 1 


M 34 397 


318 


:ur> 


410 


241 




HtiSa 


7.182/8 


188 


272 


273 


339 




Mfipt 


X64088 


530 


409 


408 


649 






AA116789 


232 


04 


63 


381 




tcrgb 


M26053 


165 


212 


244 


299 




Tnfrsfla 


M 59377 


0 


1 


1 


1 


2 Oi moro, less Ihnn 3 
















C ink hi 1 


U28404 


??) 


244 


5(14 


638 




Cihi 


X72305 


t?1 


200 


250 


355 




Csf2rd 


M85078 


17 I 


372 


402 


254 




Ebi3 


AF013114 


187 


270 


428 


148 




Giidl 


010171 


128 


164 


15(1 


257 




Ifngr 


JOS 2 65 


14 1 


263 


327 


251 




H2rg 


U21 f95 


206 


184 


231 


477 




Ldtr 


X54414 


1399 


1653 


1665 


3968 






J02870 


849 


677 


381 


640 






X6Z701 


312 


443 


476 


734 


— 


Rarg 


M 344 76 


102 


113 


114 


218 




Srb1 


U37799 


126 


232 


132 


258 


3 of more, less than 4 


















M29281 


83 


138 


243 


77 




Csf2rb2 


M29655 


209 


249 


437 


1 1 1 




Fcertg 


J05Q20 


2398 


2765 


33G5 


8751 




Fcgrtb 


X04648 . 


1703 


1652 


1431 


4005 




Ifngr2 


U69599 


1 


2 


2 


3 


4 oi more, less than 5 
















Nrlal 


X1H99S 


96 


188 


202 


401 


5 or mot e 
















11 \i2 


X59/69 


4H2 


i m 


2872 


381H 




C5r1 


L05630 


185 


434 


BOH 


1078 




Did 2 


X55674 


0 


0 


0 


219 




Fojr3 


M14215 


1 


t 


1 


2 




FpM 


L22181 


0 


.89 


141 


671 




GCR 


AA24071 1 


2 


0 


0 


0 




L-t-CJR 


AA034646 


48 


175 


314 


2056 




NMDARGB 


AAG20211 


2 


2 


0 


0 




P2lXl 


X8489G 


79 


:ur> 


530 


744 




PtfO 1 


U96682 


0 


43 


172 


378 




PiraS 


U9BB86 


274 


391 


954 


1874 




Pinal 


U986R7 


122 


630 


2014 


17 IB 




Piib 


U96RH9 


191 


445 


D6t; 


747 




Sell 


M 25324 


46 


104 


570 


20 




Tag-V4 


M 54 996 


1650 


78 


65 


315 


Receptors are identified as present whoso maximal AD values 


were more than or equal to 200 U in tills study. G«i 


)f;s whih sorted by their expt 




nis us follows; 



first by the average difference value, then by the difference between minimum and maximum AD for the 4 time points, and last hy ih« alphabetical older of sunt: symbols. Geres 
were ordered according to the* maximal told change of AD values. Abbreviations o! gene names are taken from gene symbols listed in tne Locus Link poition cil ih« National 
Center for Biotechnology Information database where available, Numbers in bold denote those gene expression patterns obtained by differential display rather than by 
oligonucleotide a nay assays. The other inf or matron Is presented as in the legend to Table 2. 

AD indicates average difference: gene symbols aro expanded In an Appendix h*. the end of this article- 



www.transfac.gbf-bra unscliwcig.de/TRANSFAC) and detenu ined 
which factor:- contained in litis database weie present at detectable 
levels in MPRO cell mRNA, using Asymetrix software for die 
criteria for inclusion of mRNAs from approximately 200 murine 
transcription factors probe sets on the oligonucleotide chip. Of 
these, 54 were expressed and 13 showed changes of 3-fold or more 
in chip signal (Table 5). 

The changes in certain transcription factors, such as the moderate 
down-regulation of myb and myc and the tip-regulation of the Max 
dimerization protein MAD, were consistent with the shift of the cells 



from a proliferative to a differentiated state. 2 * Some changes iire more 
difficult to explain, such as the up-regulation of DPI, a partner for V7f 
factors in die regulation of S-phase genes, and the mild up-regulation of 
the Id genes, commonly associated with an inhibition of differentiation 
by competition with hi IU1 transcriptional activators.^ 

'ITie C/IiBl' family has been extensively studied with respect to 
myeloid differentiation. 2,30 Absolute levels of the G/EBl* o and o 
mRNAs were low, probably at the borderline oi" significance for the 
oligonucleotide chip assay, whereas the level of C/EBP fi appeared 
higher. In addition, there were discrepancies between the chip 
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Tutj1« 4. Granulo constituents expressed during muu.su jnomyftlncytic cell tirio c«H differ e ntiM ion 











AO vaii 


'it* by array 




Granule constituent 


Cpeno symbrjj 


Gent* accessiun 


On 


24 h 


48 h 


72 h 


Azurophil (primary) ytanutes 
















Man2c1 


AA161860 


1 78 




99 


164 




Ctsb 


M65270 


' 442 


480 


595 


389 




Ctst) 


X52886 


214 


irw 


1828 


2784 




Ctsg 




1509 


405 


46 


286 




E12 






1 273 


843 


157 




E1a2 


AA089016 


47 


159 


134 


163 






M83836 


544 


228 


2fiR 


254 




Lyzs 


M21Q50 


0 




1 


3 




Mcpi8 


X78545 


83 1 


208 


68 


491 




Mpo 


x 1537a 


37 BH 


3009 


776 


692 




Pro 


X Ifi 133 


262; 


26 5 3 


2920 


9859 


Possible (jrsnule proteins 
















Ctsc 




25? 


194 


34 2 


576 






X97399 


1 


J 


4 


5 . 




Qtsh 


l/0t>l 19 


45 


1 24 


195 


158 




Ctsl 


XOnQBo 


16 


1 1 


31 


237 








1 2 


9 


88 


463 


specific secondary granules 
















Cpa3 




6?1 




90 


801 




Cd3t5t2 


ABU0B553 


113 


93 


157 


187 




C:n!p 


XJM353 


60 


475) 


704 


(12» 




Gybb 


1 14 3384 


0 


24 


91 


128 




Ear2 




0 


1 


1 


7 




F,»1 


L221H1 


178 


220 


?:i5 


84 G 




Itgb2 


X 14951 


0 


2 


4 


2 




Lcn2 


Wl3lfV8 


91fi 


.1513 


393 1 


6036 




1.(1 


J03298 


19 


162 


333 


13H 




MRP 


W45H34 


5 


1 


1 


2 




Mmp13 


X6B473 


44 


43 


72 


178 




Ngp 


I 3729 7 


2661 


4782 


2311 


091 2 


Tertiary granules 
















Mmp9 


22 7 231 


0 


1 


2 


? 



Shown are »Jie possible granuJe protein cONAs represented on th*> ollgin nucleotide arrays, sorted by thoir expression patterns as follows: first by the average difference AD 
value, thon by the granule types, and kist by the alphabetical order of gene symbols. Data are presented as descrirxnl in tho legend to lijbte 3, 
AD indicates overage difference; gene symbols are uxpanded in an Appendix at thu end ot thb article. 



estimates and the mRNA levels observed by Northern blotting with 
specific probes for these geues. In particular, the latter method, 
more sensitive and specific, showed that C/EBP a began to decline in 
the most mature cells, whereas C/EBP 8 mRNA declined progressively 
beginning at 24 hours after tlie onset of differentiation. 

C/EBP m is a more recently cloned C/EBP family member. Previous 
studies indicated it is expressed in a large array of human leukemia cell 
lines blocked at various stages of differentiation and that it is up- 
regulated during granulocytic differentiation. 31 A C/EBP t probe was 
not included in tlie oligonucleotide chips, and litis mRNA was not 
detected by DD. Therefore, we examined the C/EBP t expression 
patterns by quantitative PCR and Northern blot analysis {Figure A). 
C/EBP t exon 1 was. PCR amplified from MPRO RNAs using primers 
RY48 (AGCCCCCCACACCClTOAlXiA) and RY49 (TGOCACACI- 
GCGGGGAGACAG) " The results showed that C/EBP f. is expressed 
throughout myeloid differentiation,' with expression levels increased 
moderately in the later stage* 

We detected a number of other, transcription factors that are 
broadly expressed or that have been reported in other studies of 
hematopoiesis (Table 5). Some of the factors that were most 
strongly induced during differentiation have been studied in other 
contexts but not previously implicated in hernatopoicsis, such as a 
mammalian homologue to the Droxophila enhancer of split gene, a 
transcriptional silencer. Tlie mammalian gene is expressed at 
relatively high levels as measured by the oligonucleotide chip and 



is a candidate for mediation of the silencing of growth-related 
genes in tlie maturing neutrophil. Another candidate transcriptional 
silencer, 'fifth, may serve as a compressor for die K.RAB domain 
family of zinc finger transcription factors and also may mediate 
binding of tlie heterochromatin protein HPl to DNA. J? 

There were 2fi transcription factors whose inRNAs showed no 
significant changes by oligonucleotide chip analysis and were not 
identified as differentially icgulated genes by differential display 
assays. PUT, r factor necessary for the production of neutrophils * 
and the expression of several neutrophil genes, 14 showed (ess than a 
3-fold increase in mRNA, below the threshold for a significant 
change. Other candidate hematopoietic transcription factors, such 
as PHBPlaB2 (AMU). GATA-1, and SP-2, were represented on 
the oligonucleotide chips, but their mRNA levels were so low that 
they were reported as absent in this study. The possibility that small 
changes in the levels or ratios of some transcription factors could 
produce marked changes in transcription potentially (units, die 
ability of data generated by present methods to explain transcrip- 
tional changes during different]' at inn. 

Protein expression patterns of MPRO cells during 
ATRA induction 

We visually compared the 2DE patterns from MPRO cells m the 
same time points used for in RN A analysis. In most cases the 
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TablH 5. Transcription modulators presented during myeloid itiffeientiation 



Ai) viilue by array 



Maximal fold di a rifle 



Gene symbol 



Oerw acnes sion 



Loss than 2-fold 



2 or moru, less than 3 



3 or more, less than 4 



4 ot inoiti, less ihan 5 



Zip 11 -6 

EltfJ 

Gata2 

Hmgl 

idbi 

Max 

Nfatr.2. 

Pm1 

Kajg 

R«la 

Sox 10 

Ybxl 

Zfp162 

Cebpd 

Idb2 

Jundt 

Ly(1 

Nfo2 

Nfkbl 



Pbxl 
sfpM 

nnb 

Trp53 
Usf2 
Ybx3 
Zfp216 

Irtl 

KH2 

Myb 

St3t3 

Ifdpl 

Otbpb 
SUal4 



AB02U542 

W 13G02 

ABOOOOnB 

J04179 

M318H5 

MG3903 

AA560O93 

U33626 

M3447H 

M6l90>) 

W53527 

M828G7 

V12836 

Xu 1 800 
M69293 
W29366 
X57687 
L09600 
L2B117 



AF020196 

A34693 

UB7303 

Pi 0361 

U 12283 

L35549 

AA510137 

M21065 

U25096 

M12348 

AA396029 

Q08639 

XB2000 
Y07836 



2630 
3 

562 
337 
455 
256 
2313 
173. 
102 
297 
419 
643 
671 

1V 
244 
1274 
399 
458 
953 



611 

375 
673 
259 
129 
96 
8? 

85 
62 
892 
484 
307 

223 



771) 
343 
7B7 
224 
3218 
201 
113 
260 
461 

734 

?02 
?10 

2002 
342 
743 

2044 



303 
764 
659 
149 
185 
169 
151 

207 
66 
356 
1057 
500 

1248 
383 



2795 
2 

472 

177 
721 
312 
2396 
329 
114 
304 
434 
472 
720 

158 
310 
1434 
34 7 
1042 
1876 



340 
991 
420 
125 
286 
210 
204 

2/8 
246 
230 
1012 
SO!) 

1 380 
510 



730 
232 
637 
172 
254? 
306 
218 
244 
837 
496 
902 

■ 430 
604 

308G 
891 
505 

2034 

529 
863 
361 
192 
110 
106 

198 

n 

435 
290 
1093 

1903 
93b 



Obpa 

Ug 

MaO 

Myc 

Etohi6 

TBXt 



M62362 
X73359 
X83106 
L00039 
WB9667 
AA 542220 



33 
99 
0 
3M 
169 
0 



212 
565 
111 
112 
386 
0 



18? 
. 916 
167 
62 
313 



44 
1005 
327 
173 
1003 
? 



Shown aro the transcription (adon; identified us prusem by the oligonudootirlo array analysis whose maximal AT) btttwettn pnrlBct match and mismatch oligont/dentitli: 
s«ls was gifialt.-i than or i ; qual lo 200 U in this study. Data aro presented as dusciibed in the legend to luble 3. 
AD indicates average oiffuriince; gane symbols aro expanded in an Appendix at the end at thin article. 



peptides identified for a given protein were derived from regions 
along the entire length of the protein, indicating the observed 
products were not the result of proteolytic degradation. These 
data must be considered with several caveats: membrane and 
other hydrophobic proteins and very basin proteins are not well 
displayed by the standard 2DE approach, and proteins present at 
low levels will be missed.* 5 In addition, to simplify MS analysis, 
we used a Coomassie dye stain rather than silver to visualize 
proteins, and this decreased the sensitivity of detection of minor 
proteins. The MS method we used was sufficiently sensitive to 
identify proteins that could barely bo visualized by colloidal 
blue staining. However, a limitation of the method for the mouse 
is that the current database lacks predicted amino acid sequences 
for a substantial fraction of murine genes. In addition, very 
small proteins give only a few peptides, making statistically 
confident identification difficult. 



Figure 5 shows the analytical colloidal blue- stained 2DE IPG 
reference maps of differentiated MPRO cells. Expression patten is 
of more than 500 protein spots were detecTfid and observed through 
the entire series of gels. Protein spots could easily be cross- 
matched to each other, indicating the reproducibility of the method. 
As marked on the gel pictures (Figure 5), 50 proteins with a wide 
range of molecular weights (1 to 200 kd). isoelectric points (4 to K )\ 
and abundances were subjected to MS protein identification. The 
results are presented in Tablo 6. 

Comparing the theoretical value of die molecular weight and pi 
of each protein to that of the observed value, wc confidently 
identified 28 proteins in the expected position on the gels (spots 1 to 
28). Some of the other proteins with strong matches to the murine 
databases migrated to a somewhat unexpected pi position. Nine 
spots gave clear peptide peaks on mass spectroscopy but did not 
match any known gene. Theii identification will requiie amino acid 
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Figure 5. 2t>E eleetrophoretngrams ol MPRO c»t|s. IPG 8 

MPRO cell lysate (2.5 x 10 b celi/sample) wns loaded tor . ^ pj._| 

2 DE analysis. Cielt; wcic stained with brilliant blue G-col- j 

loiUaldyo.tAJZDEmapofunlndi/codMPROcoll^houi). Q Rrf) 45 W &6 60 5 " 5 ' l ' T * S RXi H * 45 50 55 Ci ° ft ' 5 : '° " W 35 

(B) 2DE map of mmumd MPRO culls (72 hours). Protein g| 1 2C0 . V^. ' ^ 7 V'- , '"";" ; ~^ - ^v^'"V*^r^"~ 

spots marked in the maps wore considered d iff wentialfy * ? ifjo v^' " ; * ■ ' 1f .« " " J *"" ::j?: ' ^-•■'-'^ 



expressed and wore subjected to MS analysis. The q f$ ' '^^'^^"^.^^^^^.ii^ 4^ ^ V#;i ; ; X-&££^r-;>?ii 

resultant protein information is listed in Table G. (/) I \K'J* ! '. ffoi '■■ ^ '■ S'^^ik- 



sequence analysis or availability of more, extensive niuruiP data- that were differentially expressed significantly (5-fold or greater 

bases. We searched for the expression patterns of die genes cognate cliaiiye by array or 2 -fold or greater change by DD). 

to the expressed proteins in dbMC (Table 6). Nineteen genus were 

found in dbMC, the mRN A for 5 geues was reported as absent, and 

13 genes were pre sent during MPRO diff erentiation. Comparison DlSCUSSion 

of the expression patterns showed only A genes of 18 present on the — — 

oligonucleotide chips whose expression was consistent at die RNA We explored the temporal patterns of gene expression during 

level and protein level. None of these was on the list of the genes myeloid development. A database has been developed to provide a 
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(he proteins listed here are repr^s tinted by the spots marked in the eleciroplior titograms shown in Figure 5. 

Protein definition, Gi number, and predicted value mfei to the protein name, accession number, and properties derived from the Nntionnl Center for Biotechnology 
Information protein database. I'ho oolumn labeled % shows the puroentaye ol peptides predicted from the protein sequence that were detected by mass spectroscopy. The 
expression luvet of protein spots exprussud in intxise promyelocyte cell line coll induced by all-rTtw;; ruiinnic acid for 0 tours a (id /2 hours (Figure 5) weru .senrrtd on n si^jlo of 
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Y indicates agreement and N discrepancy between changes IncONAand protHln spot Intensity. The numbers in bold wjre obtained with DD. ZUE fndicattts 2-dimonsional 
got el oarxj phoresis; IgC, immunoglobulin H; 00. differontial display. 
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reference for later research on the molecular mechanisms underly- 
ing normal myeloid development. 

The MI'RO eel! system morphologically mimics normal myeloid 
differentiation and bioelvemically proceeds mrtlier toward mature neutro- 
phils Uian most oilier in vitro systems, Because (he arrest in differentia- 
lion of MPRO ceLls growing in the absence of ATRA is not physiologic, 
Uiere is a theoretical risk that gene expression in these cells is noi 
coordinated in die way thai it is in normal rufierentiation. Ii is 
encouraging that, for the most part, the timing of expression of genes for 
proteins of the various neutrophil granules is consistent with the timing 
of the morphologic and biochemical appearance of these granule 
components during normal myeloid different] at ion. 

The DD .technique provides certain advantages for detecting 
and comparing mRNA levels ui different samples. First, the method 
is, in principle, similar to competitive RT-PCR, and, with the use of 
stringent PCR conditions, is expected to be about as reliable. 
Second, display patterns are reproducible. Third, the method 
detects the levels not only of RNAs already represented io the 
database but also of unknown RNA species that may represent 
"new" genes. Fourth, closely related genes can be distinguished 
regardless of cross-hybridization, provided there are some single 
nucleotide differences' in the 3' end sequence. Limitations associ- 
liteTT wIuTtFiHe^uiique are that numerous gels are iieeessary~to get" 
complete information and that comparison of the levels of different 
niRNAs is only approximate because of the differential amplifica- 
tion of bands of different si'^e or sequence. 

Oligonucleotide chip analysis is a fast and effective means of 
accessing mRNA expression patterns. 7 '* Cluster analysis of groups 
of samples by this approach is effective. However, the present 
results indicate that alternative methods of verification are desir- 
nble before, die data on an unexpected change in a particular gene 
are definitively accepted. 

To obtain the broadest range of information from the myeloid 
differentiation process, both differential display and oligonucleotide 
chip techniques were applied in the current study. As a result, 65 J % of 
the observed elianges in mRNA levels came from the differential display 
method and 1 1 .5% came from oligonucleotide chip assays. 

Our data showed in general that changes in expression pattern 
by the 2 methods agreed qualitatively but that there, was some 
quantitative variation. Our results indicate, that DD may be a more 
accurate way to detect changes in levels of gene expression than the 
oligonucleotide chip assay. However, improvements in the types of 
oligonucleotides used in arrays may close this gap in the future. 

The mRNAs for a limited number of trauscripUou factors vary in a 
pattern correlating with that of the mRNAs for primary or secondary 
granule proteins. However, more detailed uifonnation is needed, and the 
underlying mechanisms of granule gene regulation remain unclear. The 
number of potential positive and negative rcgulatoiy factors found hem 
is sufficiently small as to make it feasible to perform in vivo studies, 
such as cliromatin iinnnuioprccipitatiofii. 

The oligonucleotide chip used in this study focused on known 
genes, whereas the DD method samples all polyadenylated tran- 
scripts. The latter method generated a large number of products not 
associated with known genes, in pan because the mouse genome is 
not as well represented in the database as the human genome. 
However, our experience with DD and human mRNAs indicates 
that substantial fractions of the products represented as ESTs ot not 
represented at all in the public databases are cDNA copies from 
introns, hnKNA, or other RNA with internal A runs. 

Approximately 5° sequences obtained from gel -display bands 
had significant changes in the level of expression and a sequence 
that did not match that for any named gene in the public databases. 



Of these, 38 had plausible or excellent poly A signals. This is only 
an approximate estimate of the number of new genes found* 1 ' 
because a ft act ion of the mRNAs for known genes still had poor 
poly A signals. In addition, the full 3' untranslated jug ion is often 
not known for characterized genes, and in some cases these new 
genes may prove to be identical to products identified by the 
oligonucleotide chips when more complete sequences are obtained. 
At the least, their presence indicates thai a substantial fraction of 
the regulatory or functional circuitry of maturing myeloid cells 
remains unexplored and that valuable tools for their investigation 
will emerge from a combination of RNA expression studies and 
analysis of emerging genomic sequences 

The desired end point for the description of gene expression in a 
biologic system is not only the analysis of mRNA transcript levels 
but also the accurate measurement of protein abundance. The 
developments in 2DE and new MS instrumentation make it 
possible to accomplish this work rapidly and efficiently. In this 
study, we attempted to identify a number of the proteins differen- 
tially expressed between uninduccd and ATRA -differentiated MPRO 
cells and to examine the relation between mRNA and protein expression 
levels for these genes representing the same state. 

For protein levels based on estimat ed int ensity of Cooinassic dye 
staining in 2DE, there was [xxir correlation between clumges in mRNA 
leveLs and estimated protein levels. Other groups have studied the 
correlation between mRNA and piotein levels in yeast and liver 
cells. 1 '' ,2>M In the liver cell experiments,"'' 7 correlation coefficients of 
0.4 to less than 0.5 were observed. In an extensive study in yeast,"' 1 * the 
correlation coefficient was high if the most abundant mRNAs and 
proteins were considered. If a handful of these products was omitted, the 
remaining correlation coefficient was OA or less However, one 
could TestoTe some of the correlation by averaging individual 
data points into broad proteomic categories.-' 7 

The discrepancies between mRNA and protein levels in MPRO cells 
appear to be substantially larger than those observed for yeast. Possible 
causes for the discrepancies include iranslatioua! regulation, duTerential 
expression of certain mRNAs at various stages of cell growth in vitro, 
post -trans la tioiial protein modification dial varies with the stage of 
maturation of the cells, :md selective degradation or excretion of .proteins 
in vivo Furthermore, here we axe focusing on a developmental 
rime-course, whereas the yeast study concentrated on the organism m 
vegetative growth. New techniques, equipment, and bioinfnmintic 
analysis tools must be developed to make such systematic, global, and 
quantitative analyses feasible. 

The initial studies of protein expression presented liere provide a 
cautionary note for efforts to interpret cell composition and function in 
rotation to mRNA levels. Discrepancies we observed between gene 
expression and protein abundance suggest that selective post-trariscrip- 
tionaJ controls may be at least as important as changes in mRNA levels 
in determining the protein composition of neutrophils and that they arc 
phenomena less well explored man transcriptional control. Aiuttysis of 
mRNA expression patterns is itself only a small beginning toward u 
genome- wide description of cellular components. 
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Appendix 



Gene symbols used in tables: A.db: actio, beta, cytoplasmic; Actg; actiii, frurima, 
cytoptanTiic; AcU: inelanoint X-aclui; Aldol: aldolase I, A isofunii; ArfS: 
ADP-fibosylfltion fnctor 5; Atfl : activating transorijMion factor J ; Atft: activating 
tnuiKajpiion tact or 2; BtD: basic IranKcription factor 3 a; Bzrp: peiipheral-typc 
beitzodia^jepine receptor; C1rJ: couipleiiiait component 5, recqitor l/G piolem- 
coupled receptor (C5a); Ccnb2: cyclin M2: CcCt6!2: CDS6 tmtip,wi (col b pen type 
I receptor, UiromDospandin recepior)-liKe 2; Q153. CD53 mdigen; C^;l>pa; 
CCAAT/eiihancer binding protein C/EHP, alpha; Ccbpb: CCAAT/enhancer 
buuling protein (C/EBP), beta; Cebpd: CCAAT/ejuSancer binding protein (C/ 
L;DPJ, ddtt; Ci-l'pe: CtAAT/uilnrtiiLY buidiivi protein (OTBP), cpsiloa; Cflt 
cofiUn | ( nonmuscle; Ciiikai'4: cheinukuut (C-X-C'J.rectrp(or 4; Cinkbrl : cheino- 
kint (C-C) receptor 1/MiplH receptni: Dilp: cathclm-like protein; Cntf: ciliary 
neurotropic factor/2 in c iinKer protein P/J 7 ; Copa: cuatomer protein complex 
stibunit alptm; CpfC : civr boxy peptidase A3, mast cell; Cr2: complement rvceptoi 
2; O'fu™ corticotropin releasing hormone iece|)tor; QTy: complement tveeptm-- 
related protein; Csi'lr; CSV I (M-CSF) rei:(iptar/c-frns/CDllS; Csf2ra: CSI*' 2 
(GM-CSI : ) receptoi; alptw, low -affinity/CD ( 1 6; Cf,f2rbl: CSF 2 (GM-CSF) 
reeeplur, beti 2, low-ailiiuty/IL H recejMor-likK fuotctn (AlC2UVCDwi;i I ; 



Oit2r1)2: CSF 2 (GM-CSF) leccpfor, b«ta 2, Inw-aiTuiity/ILO receptor (A1C2A); 
Ctsb: cathepsin C; Ctsc: catliepsin C; Ctsd: catliepsin O; Ctsc: cnthepsin E; Qsg; 
cathepsin G; Dih: cailvftpsin \l, Ctsl: catliepsin L; C(ss: calhepsin S; Cybb: 
cytodinnne b-245, beta; Drd2: dopamine receptor 2; E2tV f;2F transcription 
factor 1; Hht?.: eoKuiophil-Hssnciatnd rilionuclease 2; K;bD: Epstein-Barr vuits- 
iiiduccd f'^ne Vcytckuv: receptnr-like molecule (KU!3); B2; BalbVc ucutiophit 
elasta.se; KIa2: ettstase 2; lid) , enhance r of MiduncniAry homo log (Orosophib); 
Htotu6: etk-uiol induced d y sterol regutatoiy element binding transcription factor I 
(SREBFI) hornolofi; F2il2; coa^uLititm factoi' II (Huoinbin) receptor-like 2: 
Fcv:: 1 Fc rert:ptor, IgE, hit-Jt ntYinit) l.mmmn pnlypppiide; FcgfJIv Fc receptor. 
I(jG, low aftlndy lib; Fr.grlt: Fc receptor, IgG, low atTtnjty III; Fpil: funny I 
pept itic receptor I/ihil.P rt-ceplor; G.ibpbl : GA repuat binding protein (GABP- 
betal suhtuiit); Gat»2: GATA-biiidinR j)iott:in 2; Gnas: n^anine nnclttotide 
binding protein, alpta sLimiiliiting; Gnb2nsl: guiuiijie niwleotide buuliuf» protein, 
beta-2, retitcd sequence I; Gp.O: glntatfiione pi:r\)\idase 3; Gig: related to 
Drosoplub grouefio gwic; Grid I; glulim;de n:ct^)toi chiuinuJ submiil delta I; 
Gin: granulin; Gstinl: RlutHtiHnnB-S-traiislciitsc, inn 1; Gus-s: beta -glue i iron i- 
dftse stntclural; Gys.1; pjycoj'.en synthase 3, brain; H2-D: hisloconifiatibtlity 2, f) 
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legion locus I; Hisil: histone gene coinpip.* 2: Hist5-2ax: H2A tustone family, 
member X; Hingi; high mobility group protein I; Hsp60: heat shock protein, 60 
kDa: Htr5a: 5-hynYn.\ytryptajnine (serotonin) receptor 5A; Idbt: inhibitor of 
DNA binding I /he I is- loop-Mi* UNA binding protein regulator (Id); Idb2: 
inhibitor of UNA binding 2; ling)" interferon gamma receptor; lthp/2\ interferon 
gamma receptor 2; li: la-associated Li variant chain; HI a: ILl alpha; lilr'2: ILI 
receptor, type II; II2rg: IL2 receptor, gamma chain; IMra: 1L4 receptor, alpha; 
I! I Orb: ILK) receptor, beta; Ill7r H. 17 receptor; Irfl : interferon regulatory factor 
I; Irf2: interferon regulatory facta -2, Itgb2: integrin belt 2 (Cflltt); ItnrS: inositol 
1,4,5-trisphospliate receptor (type 2); Jundl: Jim pro! o-oncogene -related gene 
dl/transcription factor JUN-D: KIR: Kruppellike ihctoj LKLF; L-CCR; lipopoly- 
saccharide inducible C-C chemokine receptor-related; l.cn2: lipoailin 2; Ldtr: 
low density lipoprotein receptor; Lspi: Lymphocyte-specific !/S.17/pp52; I.srfi: 
leucocyte- specific transcript 1; Ltb4r: leukotriene B4 receptor; Ltbr iyniphotoxiii- 
beta receptor; Ltf: lacioUansfeirin; l .yM: lymphocyte antigen 64; Lyfce; lympho- 
• cyte antigen 6 complex locus E; Lyll ; lymphoblastoma leiikcinia/biiLI I factor; 
l.yzs: lysozyme; M6pr. niamiose -6- phosphate receptor, cation dependent; Mad: 
Max dimerization protein; Man2cJ: mannosidase, alpha, class 2fJ, member I; 
Max: Max protein; Muz: MYC-associated zinc finger protein (purine-binding 
transcription factor); MBP: eosinophil granule major basic protein precursor: 
McptS: mast cell protease 8; Mil: myeloid/lymphoid or mixed-lineage leukemia; 
Minpl3: matrix metalloproteinase 13/coIhgeuase; Mrnp9; matrix mctuUoprotein- 
ase 9/gelatinasc B; Mpo: myeloperoxidase; Myb: myeloblastosis oncogene; 
Mybl2: myeloblastosis micogene-Iike 2; Myc; myelocytomatosis oncogene; 
— ^4yUlH3^sia4ig!^uaiivaiJ'aLi r ^^ 

cells, cytoplasmic 2; Nfe2: nuclear factor, erytliroid-derived 2, 45 kl)a; Nikbl: 
NF-kappa-B (pi 05): Ngp: neutrophilic granule protein; NMDRGB: N-methyl-D* 
aspartate realtor glutamate-binding chain homolog; Npml: nuclcophosmin 1; 
Nr4al: nuclear receptor subfamily 4, group A, member 1; Osi: oxidative stress 
induced; P2rxt: purinergic receptor P2X, ligand-gated ion channel, 1; P2ry2: 
purinergic receptor P2Y, G -protein -coupled 2; P40-K; P40-B, functional/bminin 
receptor-: Pbxl: pre B-celf leukemia transcription factor I; Pit; properdin factor, 
complement; Ptrat: rviired-Ig-like receptor Al; Ptra5: paired- Ig-like receptor 



A5; Pirafi: paired- Tp-like. receptor A6; I'irlv paired-Ig-like receptor B; Plaur: 
urokinase plasminogen activator receptor; PMI: putative receptor protein (SP: 
PJ7152 ); Pip.!: prnmydocylic leukemia; Pj^: proteoglycan, secretory granule; 
Prg3: proteoglycan 3/eosinophiJ major basic protein 2; PrtnJ: proteinase 3; 
Psirm2: proteasnmc (pro some, macropain) subunit, alpJw type 2, Ptmlvl; prolliy- 
inosin beta 4; Ptjuc: protein tyrosine phosphatase, receptor type, C; Rac2: 
RAS-related CS botnlinum substrate 2; Raig: retinoic acid receptor, gamma; 
Rela: avian reticuloendolhcliosb; viral (v-rel) oncog(3ie homolog A/NF-kappa-D 
p65; RpHV: rihosomai protein LI 9; RPLR; ribnsomal protein LtJ; Rps6kal; 
ribosoinal protein S6 kinase pnty peptide I; RpsS: ilbosornal [note in ^K; RmH: 
retieulnn 3; Sl00a8: SIO0 calcium binding protein AS (cnlgrmmlm A); SKJOaV: 
SI00 calcium-binding protein A9 (catt'/nnului B); S<lfr2: stromal celt-derivwl 
tador recqnor 2; Sell: selectin I. (lyinplwcyte adliesion molecule IJ; Sema4d: 
semaplwrm 4D; SeppI: selenoprotein P„ plasma, I; SfpiL SFFV provind 
integnitirin I; Siifdgl: split luud/foot deleted gene 1; Slclflnl. solute carriei 
family 10 (sodiurrutiile acid cotnuisptuter tanuly), member 1; .Nlpi: secretory 
leukocyte pioteaye inliibitor; Sox 15: 5»RY-box containing gwie 15; Spi2- 1 : serine 
[jrotea«.* inhibitor 2-1 ; Sib I: scavenger receptor class B t; StnO: signal transducer 
And adivator of (inscription 3; Slioa: signal 0'ansducer and activator of 
transcription 5A; Stat6: signal transducer and activator of u ;ui script ion 6; Stral4: 
basic-helix-loop-hclix protein-ret inoic acirl induced; Tbxl : TBXI protein/LPS* 
induced TNF-alpha facttar homolog; Ta*gb: 'I'-cdl -receptor gemilint! beta^tyiin 
gejie cojxstant rttgion; Tcrg-V4; T-cell-receptor gamma, variable 4; Tctexl: 
t-complox testis expressed I; Tfdpl: transcription fhctor Dp I; Tiflb: transenp- 
-tioflal JnlWTiwuary^ctriLl^bcla ; Jflr4 Jdl:JjJa^re^eiaor^;.;riy^i^^ 



receptor si^erfamily, member la; Tnli-stlb: INF superfamily, member lb; 
'I"omm70ii: Uanslocasc of outer mitochondrial meinbrane 70 (yeast) homolog A; 
Tpi: trio srphosph ate isomer a se; Trp53: Uajisformatioiwelated protein 53; Ubb: 
ubiuuitin B; Usf2: u()Stieam transcription factor 2; Ybxl: Y box transcription 
factor; Ybx^V' Y box binding protein; 7Sp 1 1 -6. -m\c. fiiigei' protein si t -o; Zlpl 8: 
zuic linger pi otein 18 homolog; Zip 36: xi M c finger protein S(t; Ztj)l62: zinc ringer 
protem lf>2; Zlp2l6: zinc finger pi otein 216; ZfpiTil: zinc finger protein, 
inultitype I ; Znfi\lat : zinc finger protein, subfamily I A, 1 (lk:u-as); Zyx: zyxin. 
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Bacterial lipopolysaccharide (LPS) evokes several 
functional responses in the neutrophil that contribute 
to innate immunity. Although certain responses, such as 
adhesion and synthesis of tumor necrosis factor-n:, art; 
inhibited by pre treatment with an inhibitor of p38 mi- 
togeu-autivated protein kinase, others, such as act in as- 
sembly, are unaffected. The aim of the present study was 
to investigate the changes in neutrophil gene transcrip- 
tion and protein expression following lipopolysaccha- 
ride exposure and to establish their dependence on p38 
signaling. Mieroarray analysis indicated expression of 
13% of the 7070 Affymetrix gene set in nonstimuiated 
neutrophils, and LPS up- regulation of 100 distinct 
genes, including cytokines and cheniokines, signaling 
molecules, and regulators of transcription, Proteomic 
analysis yielded a separate list of up -regulated modula- 
tors of inflammation, signaling molecules, and cytoskel- 
etal proteins. Poor concordance between mRNA tran- 
script and protein expression changes was noted. 
Pretreatment with the p38 inhibitor SB 203580 attenu- 
ated 23% of LPS-regulated genes and 18% of LPS-regu- 
lated proteins by >:40%. This study indicates that p3B 
plays a selective role in regulation of neutrophil tran- 
scripts and proteins following lipopolysaccharide expo- 
sure, clarifies that several of the effects of lipopolysac- 
charide are post-transcriptionnl and post-translation al t 
and identifies several proteins not previously reported 
to be involved in the innate immune response, 



Lipopolysaccharide (LPS), 1 a component of the outer cell wall 
of Grain -negative bacteria, evokes a variety of functional re- 
sponses in the human neutrophil (PMN) after binding to a 
plasma membrane receptor complex that involves the Toll -like 
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receptors (TLRs) (1-5). These ''immediate" functional re- 
sponses, including actin assembly, adhesion, activation of nu- 
clear factor- kappa B (NF-kB), and priming for an enhanced 
secretory res po nee and for release of reactive oxygen interme- 
diates, appear to t>e central both to the innate immune re- 
sponse and to the pathogenesis of several inflammatory human 
diseases, including sepsis and the acute respiratory distress 
syndrome (6). p38 mitogen -activated protein kinase (p35 
MAPK) has been shown to mediate LPS- induced PMN adhe- 
sion, NF-icB activation, and TNF-cr and iL-8 translation and 
release (7), and its blockade attenuates LPS-induced PMN 
accumulation in the airspace (8), However, other cascades al- 
most certainly lead to downstream effectors of the LPS signal; 
for example, actin assembly appears to be p3S MAPK-inde- 
pendent(9). An improved understanding- of the transcriptional 
and translation al responses of the neutrophil to LPS and the 
modulation of these responses by p33 MAPK might carry 
pathogenetic and therapeutic implications. 

Historically, it has been believed that the downstream PMN 
transcriptional response to LPS is static and that PMN func- 
tional responses to LPS that depend on de novo protein syn- 
thesis an; primarily limited to the release of cytokines (10J. 
However, recent studies indicate a robust transcriptional re- 
sponse (11). To date, most studies have relied upon and re- 
ported a short list of functional assays of the LPS-exposed 
PMN; therefore, no exhaustive investigation of either the tran- 
scriptional response or protein synthetic repertoire of the PMN 
has been reported. Although several techniques have been used 
to evaluate transcripts, the screening of global changes in 
mRNA by mieroarray analysis has only recently become possi- 
ble. In this way, thousands of genes can be screened in an 
unbiased fashion ibr transcript abundance. Such genomic 
screens in mammalian colls have previously been applied to 
define altered expression profiles in response to agonists (12) 
and to drug action (13) and during cell cycle progression (14). 

Although DNA mieroarray technology is expected to provide 
insight into the response of the human PMN to LPS (15), 
inhibition of LPS stimulated l.L-'l. and TNF-or production by 
p38 MAPK inhibitors in THP-1 cells (16) and of TNP-cv synthe- 
sis in human PMNs (9) occurs at a translational level and 
would therefore not be detected by DNA micron Trays. Further- 
more, in other systems, such as yeast and human liver, mRNA 
and protein levels show poor correlation (1,7, 18). Proteomics is 
a complementary tool for assessing global change** in cellular 
protein expression, thereby providing additional insight into 
cellular signal regulation. A proteomic approach has proven 
useful in different systems for dissecting signal transduction 
cascades and describing their output (19, 20) and has even 
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recently been usod to detect novel upstream messengers in- 
volved in LPS signal transduction (21). We have applied DNA 
microarrays and proteornics to define and compare transcrip- 
tional and pnst-transcriptional alterations in tit e LPS-ex posed 
PMN and to establish the dependence of these alterations on 
p38 MAPK signaling. 

EXPERIMENTAL PROCEDURES 

Materials — Endotoxin -free reagents and plastics were used in alt 
ex pari men ts. A pro Limn, I mi p p. p tin, AEBSF, E-04, peps ta tin, and I ich La- 
tin protease inhibit ore, spermine HOI, and n:-cyano-4-hydroxyoirinamic 
acid (CHCA) were all purchased from Sigma Chemical Co. (fit. Louis, 
MO). SB203680, a p38 MAPK inhibitor, was purchased 'froin Calbio- 
ehem-Novabioehem Corp, (San Diego, OA). Kor two-dimensional PAGE, 
rehydration butler, cqui libra ti on buffers, vertical electrophoresis solu- 
tions, and 10% homogeneous polyacrylamide slab gels wen; purchased 
from Genomic Solutions, Inc. (GSI, Ann Arbor, MI). Sequencing grade 
porcine trypsin was purchased from FYomega (Madison, WI). 

LPS Incubation. — PMNs were isolated by the plasma PercolJ method 
(22), n technique that yields less than 5% monocytic contain iiiation, and 
resuspended at a concentration of If). 4 X 10°/ml in RPM1 1640 culture 
medium (BioWhittaker, Walkeravitle, MD) supplemented with 10 mM 
HEPES (pH 7.6) and 1% heat-inactivated platclet-poor plasma. After 
addition of 100 ng/ml Escherichia coli 0lll:B4 LPS (Lust Biological), 
incubation was carried out with continuous rotaLion (4 h, 37 °C) both in 
the presence and absence of SB203580. Both Afrymetrix analysis and 
protoomic analysis utilized 75 X 10° cells. For micron rruy analysis, 
nonstimulated and 4-h-treatcd PMNs were collected from three sepa- 
rate donors. A more detailed lime course following' LPS exposure was 
performed using polymerase chain reaction. Kor proteoinic analysis, 
LPS incubations from separate donors {n ~ it) were performed and then 
analyzed individually. Control ami post- LPS incubation PMNs were 
washed (0,34 m sucrose-/ 1 mM EDTA/10 mM Tris) and then lysed in a 
modified rehydration buffer (GSI, Ann Arbor, Ml ) supplemented with 2 
m thiourea, 50 mM dithiothreitol (D'lT), 22.5 mM spermine HC1, and a 
mixture of six protea.se inhibitors (10 /ig/ml aprotinin, 10 fig/ml leupop- 
tin, 2 mM AEBSF, G p.M E-64, 1 >im pepstatin, 10 bes latin). DNA waa 
pelleted by ceiitrifugatiou at 250,000 x g for 60 min (23). 

Asymetrix Oligonucleotide. Array — Five micrograms of total RNA was 
isolated with TRIzol (Invitrogen) ami RNTeasy columns (Qiagen) ami 
subsequently labeled with biotin as described by Asymetrix. Briefly, 
first-strand synthesis was accomplished with Superscript II reverse tran- 
scriptase (Invitrogen) using a Tf-oligofdT^ primer for 1 h at 42 U C 
followed by secoi id-strand synthesis using E. coli DNA pulym erase I and 
KNase H (Invitrogen) at 16 °C lor 2 h. Double-stranded DNA was used as 
a template for in vitro transcription with T7 RNA polymerase in the 
presence of biotin -labeled UTP and CTP using the BiuArrny High Yield 
RNA transcript labeling kit, (Enzo). Fifteen micrograms of cftNA was 
fragmented and used for hybridization ta Aflymetrix HuGene (I800FL 
(jeuechips. Each sample was hybridized initially using a Test2 Genechip 
to test for sample degradation and full-length in vitro translation. Data 
were analyzed using Alfymotrix Genechip software. Results from three 
separate donors were analyzed. 

Reverse. Transcription, and. Polymerase Chain Reaction— r.ON A was 
prepared by reverse transcription using 2 /xg total RNA T derived from 
20 X 10 fi cells that were treated as indicated. Polymerase chain reac- 
tions were performed using' sped fie primers for Mx-l, TNF-x, MC! } - /, 
p65, S10OA4, and gly eer aid ehyde-3- phosphate dehydrogenase. 

'Vivo- dimensional PAGE— The protein concentration of the ly sates 
was measured as described by Bradford et al. (24). Poor isoelectric 
focusing (IEF) results were encountered unless the polycationie sperm- 
ine was diluted (data not shown); therefore, lysatca were diluted with 
rehydration buffer (GSI, Ann Arbor, Ml) to achieve a final spermine 
concentration of 6 mM. Equal protein loads (1.5 mg) of control and 
LPS-stinmlated neutrophils were used to rehydrate IEF gels overnight 
(18 cm, pH 3-10 nonlinear fmmobiline DryStrip IEF gels, Amersham 
Biosciences; Pi sea ta way, NJ). [EF was performed at 20 a C to 100-kVh 
(Phaser, GST) under mineral oil, followed by two 10-miu SDS equilibra- 
tion steps (DTT and then iodoacc tami de-con tnining equilibration bun- 
era, GSI) and then by vertical electrophoresis on 10% homogeneous 
polyacrylamide slab gels (GSI) at 500 V. Protein spots were visualized 
by agitation in colloidal Coomassie Brilliant. Blue G-250 (10 h) (25), 
followed by destaining in deioniznd water (20 h). In separate experi- 
ments, control and LPS stimulated PMN lysates from three donors 
were pooled and then analyzed by two-dimensional PAGE using over- 
lapping narrow isoelectric point (pi) ranges (18 cm, pH 6.0-G.0, 5,5-- 



fi.7, and 6-11, Amersham Biosciences, Pis ca tawny, N.J). Identical IEF 
and vertical electrophoresis parameters were used for all gels. 

Image Analysis of Two-dimensional Gel s— Colloidal Coornas.sie- 
st! lined gels were digitized using a Powerlook II (UMAX Data Systems, 
Inc., Taiwan) flatbed scanner with 8-bit dynamic range and 150-dpi 
resolution. Biol mage (GSf, Aim Arbor, MI} 2D- Analyzer software was 
used to locate, qua mi la to, and match protein spots on the control and 
LPS gel images. Analysis was performed by assigning 50 common 
anchor spots between paired images; the remaining spots were com- 
pared by a constellation-matching algorithm. All data were then care- 
fully reviewed by the operator to account for any discrepancies. Protein 
loading between control and experimental gels may have varied be- 
cause of inconsistencies in rehydration of the different IFF gel strips, 
therefore, gel images were normalized so that the sum of the integrated 
intensities of all matched spots on paired gels was made equal. Control 
and LPS-ati undated gel images from individual donor experiments 
were matched to generate, composite images; composite images were 
then matched into a master composite image to track the LPS response 
of protein spots among different donors (2fi), Only those spots that were 
common (image-rnatched) to all original 12 (pfl 3.0-10.0) gels were 
considered for further analysis. For these spots, the LPS-induced 
change in integrated intensity in the six experiments was subjected to 
statistical analysis with a two-tailed Student's t test, and those spots 
with p < 0.05 were identified by peptide mass fingerprinting (described 
below). For the narrow range (pH 5.0-6.0, 5.5-6.7, and 6-11) two- 
dimensional PAGE experiments using pooled donors, only those spots 
with concordant regulation exceeding' 1. 5-fold or that appeared de nova 
in the LPS gel in two repeat experiments were further analyzed. 

Ill-gel Tryptic Digestion — Itvgel digestion of protein spots was per- 
formed with sequencing grade porcine-modified trypsin using the 
method of Hcllman et al. (27). Tryptic peptides were then extracted (50 
/il of 50% :icetonitri)e/5% trifluoroacetic acid, 2 h), and the supernatant 
was taken to dryness in a vacuum centrifuge and then red issnlved in 
trifluoroacetic acid (20 jtd, 0.5%). Peptides were then purified and con- 
centrated using ZipTip Cia pipette tips (Millipore, Bedford, MA). 

MALDI-TOF Mass Spectrometry — Analyses were performed on an 
Applied Biosystems matrix assisted laser desorption ionization time-of- 
flight (MALDI-TOF) Voyage r-DE PRO mass spectrometer (Framing* 
ham, MA) operated in delayed exti nction mode. Samples (0.5 /d) were 
spotted onto u sample plate to which matrix (0.5 ;d of 10 mg/ml CHCA) 
was added. The sample-matrix mixture was dried at room temperature 
and then analyzed in reflector mode. CHCA was also spotted alone as a 
negative control. Spectra were the sum of 100 laser shots, and those 
peaks with a signal-to-ucnse ratio of greater than 3: I we. re selected for 
data base searching. Spectra were internally calibrated using auto lytic 
trypsin peptides (mh. 842.51, 221 1.10), 

Data Base. Searching Algorithm — The monoiso topic masses for each 
protonated peptide were: {«) entered into the program Mft-I'^t (available 
at prospector. uesf.edu) for searches against the, Swiss-Prot, NCBL 
and GenPept databases, and (f>) entered into Mascot, (available at 
matrixacience.com), an algorithm testing statistical significance of pep- 
tide mass fingerprinting' identifications. For MS-Fit searches, masses 
derived from trypsin, CHCA, keratin, and Coomassie Brilliant Blue 
O-250 were excluded. Search parameters included a maximum allowed 
peptide mass error of 0.1 I.) a (0 8 Da in the few instances in which linear 
mode was used), consideration of one incomplete cleavage per peptide, 
pi range of 3.0-10.0, and molecular mass range of 1-?,00 kDa. Accepted 
modifications included carbarn idomethylation of cysteine residues 
(from iodoaoetamide exposure following 1EF) (28) and methionine oxi- 
dation, a common modification occurring during SDS-PAGE (29). Pro- 
tein ideutificatioiLS were assigned when three criteria were met: I) 
statistical significance ip < 0.05) of the match when tester) by Mascot 
(ma trixscience.com); 2) >20% yequenee coverage by the tryptic pep- 
tides; and 3) concordance f z.l5%> with the molecular weight and pi of 
the parent two-dimensional PAGE protein spot. The following' special 
exceptions were considered: (a) protein identifications not fulfilling 
criterion 2 were still assigned if criteria 1 and 3 were fulfilled and no 
other Homo sapiens proteins with peptide mass-matched p values < 
0.05 were identified by Mascot; (ft) if criterion 3 was not fulfilled (lower 
than expected molecular weight), a cleavage product of the identified 
protein was inferred, and the cumulative molecular weight of the tryptic 
peptides was compared with that of the two-dimensional -PAGE a put to 
ensure that it was not exceeded; in) if criterion 3 was not fulfilled (Isolated 
discordance between then re ti en! and observed pi), post-translntional mod- 
ification of an in i recovered peptide was inferred; and (d) if two or more //. 
sapiens protein assignments with >4 mutually exclusive matching pep- 
tides were identified, a protein mixture in the two-dimensional PAGE 
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spot was inferred and further analysis halted (quantitative conclusions 
regarding the individual protein constituents could not be drawn). 

RESULTS 

Genes Differentially Expressed in. LPS- stimulated Neutro- 
phils — Human PMNs were left untreated or incubated in the 
presence of 100 ng/ml LPS for 4 h. As a control to confirm that 
the PMNs were quiescent at baseline and that LPS resulted in 
normal stimulation, rnRNA was isolated, cDNA was prepared, 
and PCR for TNF-ty was performed. Little TNF-a expression 
was seen in nonstimulated cells, whereas LPS treatment led to 
an increase in expression in each of the donor3 subsequently 
used for microarray analysis (data not shown). No macrophage- 
colony stimulating factor receptor transcript was detected by 
oligonucleotide microarray analysis, confirming there was no 
significant monocytic contamination. 

Human PMNs express a limited repertoire of rnRNA tran- 
scripts at baseline but respond to IiPS with d itfe run ti a 1 expres- 
sion of genes in many families. Considering only those genes 
present by microarray analysis in all three donors, unstimu- 
lated PMNs expressed 1.3.0% (923 of 7070 genes) uf the Af- 
fymetrix gene set. Gene classes represented at baseline include 
metabolic enzymes, structural proteins, receptors, signaling 
proteins, and transcription factors. By comparison, human 
monocytes expressed —40% and human fibroblasts - 35% of the 
represented genes (data not shown). By the criterion of a >3- 
fold increase in expression in all three donors on Asymetrix 
oligonucleotide array analysis, exposure of PMNs to LPS for 4 h 
resulted in the up- regulation of 100 genes (Table I). 

Genes from several different functional classes were induced 
in PMNs following LPS exposure. Of interest, a number of 
transcriptional regulators were induced, including transcrip- 
tion factors of the NF-kB family. The transcriptional NF-*B 
complex has previously been implicated in the regulation of the 
genes induced by LPS (11). The genes for several cytokines and 
chemokines were also found to be up-regulated. These include 
TNF-tt t /L-//3, MCP-1, MIP-3a, und MlP-lfi (Table I). 

PCR was performed to confirm the results from the microarray 
analysis. PCR analysis on selected genes indicates that the 
time course for changes can be rapid or delayed but parallel the 
changes found in the array at the 4-h time point (data not 
shown). Other up-regulated genes included those for metabolic 
enzymes, immune response molecules, kinases, phosphatases, 
signaling molecules , adhesion and cytoskeletal components, 
interferon -stimulated genes, and those with unknown or mis- 
cellaneous function (Table I). 

LPS stimulation of PMN also resulted in the down-regula- 
tion of 5G genes (Table II). Down-regulated genes were identi- 
fied as transcriptional regulators, protein and lipid kinases and 
phosphatases, structural molecules, and signaling molecules. 
Genes for metabolic proteins were also evident, as were several 
uncharacterized genes. 

Two-dimensional PAGE and Image Analysis— \i\ contrast to 
the limited number of transcripts found at baseline, PMNs 
were found to express a large number and variety of proteins in 
the nonstimulated state (Fig. J , A and C, and Tables I1J-V). 
Reproducible protein expression patterns were found on the pH 
3.0-10.0 gels, and the majority of proteins fell in the pH 5.0- 
7.0 range (Fig. 1A). The basic region (pH > 7.0) consistently 
exhibited poor resolution, precluding meaningful image analy- 
sis and further workup (data not shown). Depending on the 
spot-finding parameters (minimum spot intensity, filter width) 
selected on the image analysis software, spot-by-spot manual 
editing was found to be necessary to avoid over- and underde- 
tected spots; moreover, further manual editing was performed 
to screen for unmatched and mismatched spots following 
matching of paired control and LPS-stimulatcd gels. After spot 



editing, —1200 well- resolved spots were evident on each pH 
3.0-10,0 gel. In an attempt to improve resolution of the pi 
range bearing the greatest number of well -resolved spots, over- 
lapping narrow pH range gels (pH 5.0-6.0, 5.5-6.7, 6-11) were 
also run. Of interest, a similar number of well-resolved spots 
(—1200) were detected on the narrow pH range gels (Fig. 1, C 
and D). Assuming a detection limit for Coomassie of 15 ng (0.20 
pmol, or 1.5 X 10 11 molecules, for a 60-kDa protein) and a 
protein load per gel corresponding to 75 x 10 G PMNs, wo 
estimate a detection limit on our gels of 2000 molecules/cell for 
a 60-kDa protein. As investigators have suggested in other cell 
lines with the use of high resolution two-di mens ion al-PAGE 
methods (30), we estimate that > 1.0,000 proteins are expressed 
in the resting PMN. 

Human PMNs respond to LPS with the differential expres- 
sion of a large number of proteins. In the six individual pH 
3.0-10.0 experiments, the number of protein spots that in- 
creased in integrated intensity by at least 50% following LPS 
exposure was 185, 122, 104, 104, 96, and 131, respectively. The 
number of protein spots that decreased by at least 50% follow- 
ing LPS exposure was 72, 151, 102, 98, 128, and 97, respec- 
tively. Although gel-to-gel regional variability in resolution was 
expected to account for individual spots not being well visual- 
ized on particular gels, only those spots that were matched iv 
all 12 original gels were analyzed further. Overall, the number 
of spots matched to all 12 original gels was 125. The numbers 
of spots that were both matched to all 12 original gels and that 
increased hy at least 50% in integrated intensity in the indi- 
vidual experiments following LPS exposure were 46, 13, 17, 27, 
22, and 20, respectively. The numbers of spots that were 
matched, to all 12 gels and that decreased by at least 50% were 
6, 22, 17, 22, 34, and 28, respectively. The LPS-induced change 
in integrated intensity of the 125 spots that were matched to ail 
12 original gels was subjected to statistical analysis with a 
two-tailed Student's I test, and those spots with statistically 
significant (p < 0.05) regulation among the six experiments; 
were identified by peptide mass fingerprinting (Table III). 

Identification of LPS -regulated Proteins- -Several proteins 
were consistently up- regulated on the pH 3.0-10.0 gels (Table 
III), including regulators of inflammation (annex in 1X1.) and 
signaling molecules (Rab-GDP dissociation inhibitor fl). Sev- 
eral aetin fragment's were seen to be consistently up-regulated 
in the six experiments following LPS exposure (Table 111). Of 
interest, the proteasome jB chain was also consistently up 
regulated. Down -regulated proteins included other signaling 
molecules, such as Rho GTPase activating protein 1. 

On the pH 5.0-6.0 and 5.5-6.7 gels, several proteins were 
found to show increases of greater than 1.5-fold following LPS 
exposure (Tables IV and V), including cytoskeletal proteins, 
such as moesin, non muscle myosin heavy chain, and a putative 
phusphorylated form of non muscle myosin heavy chain, and 
signaling molecules, such as protein phosphatase 1 and P0 4 - 
stat.hmin. The putative phosphorylatcd form of nonmuscle my- 
osin heavy chain {spot til 101) was positioned 0.03 pil unit more 
acidic than the unmodified protein (spot #1102) (Pig. W) and 
was distinguished by a tryptic peptide (m/z 1366.74) not pres- 
ent in the unmodified protein, consistent with phosphorylation 
of serine 685. Serine 685 is predicted by NetPhos 2.0 Prediction 
Server (available at www.cbs.dtu.dVservices/NetPhos/(31)) to 
he a high probability phosphorylation residue and by Scan- 
Prosite (www.expasy.ch/tools/scnpsite.htrnl) to be a substrate 
for protein kinase C. The tryptic phosphopeptide identified in 
P0 4 -stathmin, extending from residues 15 to 27 (1468.7 Da), is 
consistent with phosphorylation of either serine 16, a known 
substrate for Ca^* '/calmodulin (OaM)-d open dent kinases (32), 
or serine 25, a known substrate for p38£ and EUK (Fig. 2A) 
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Tabu? I 

Human neutrophil genes induced after 4 h of LPS exposure. 



Description 



OfiiiDank"' no. 



Changt;-!bUi 



Transcriptional regulation 



Pleomorphic (ulen.oma gene-like 2 


D83784 


16.8 


NFKB2 


S7GG38 


12.;* 


NFKBIE 


U91616 


11.5 


p65 


L 19067 


8.4 


ECU 


U05681 " 


7.7 


K box binding protein I 


M31627 


7.5 


Metal-regulatory transcription factor 1 


X78710 


7.4 


Ets-2 


J04102 


7.4 


c-Rel 


X75042 


6.2 


NFKBl 


M58603 


5.8 


Basic laucine zipper transcription factor, A'W-like 


U15460 


4.7 


1KB 


M69043 


3.8 


MAX dimerization protein 


L06895 


3.6 


DIF2 


S819I4 


3.1 



Cytokines find receptors 
'MCP-1 
MIP-ip 

aHelix coiled-coil rod homoloj; 
ILlfi 

GF03 (beta) 
TNF-a 

MlPSa 

t Lion A 

IL-6 

GKOa 

HM74 

I m nm ne response 
Oronomucoid 

Complement component C3 

Protease inhibitor 9 

Complement component 3a receptor I 

Protease inhibitor 3 

SLP//antileukoprotease 

i?Z,AA7/2/elastase inhibitor 

CD58 

Complement cumponent PFC 

Kinases 
CNK I FHK / PL K-lik e 
Cot 
Pim-2 
LIMK2 

Phosphatases 
PAC-UDUSP2 
DUSP5 
PHA1 

Signaling molecules 
TNFAIPlih2Q 
THAFl 
RanBP2 
GNAI5 
PTAFH 

Adhesion anil c.ytoskeleton 
I CAM 1 

CEACAMl (bilary alyeciprotem) 
LfMSl 

SNLhicXiu bundling protein 

GalccUn-l/LGAIJSl 

MEMDiALCAM 

CD44 

TSQ-6 

Metabolic 

OTP cyclohydrolasr. I 
N/)LW2/ubiquinone reductase 
PSAMC/tproteusorne iota) 
UDP~galactose transporter (SLC35A2) 
PLAU (urokinase) 
KYNU/Lrkynuroninfi hydrolase 
AMPD3 

P4HA i /p r oly 1 4 - hyd roxy i a s a 
■v Glutamylcysteine synthetase 
ATP6D 
ATP6S1 



M 69203 

M72885 

AF014S58 

X04500 

M5773 1 

X02910 

U64197 

U0OG72 

Y00081 

X64489 

1)10923 



X02544 
K02765 
U7 136"4 
U28488 
LI 0343 
X04470 
M 9 3 05 6 
YOU 636 
M83652 



U5C998 
D14497 
U77735 

D45906 



LI 1329 
U 15932 
U73477 



M59465 
U 1926 1 
1)42063 
M63904 
D 10202 



M 2 4283 
X 16354 
U09284 
U03057 
M57710 
U30999 

H02S8I--HT3125 
M31105 



78.7 
48.8 
20.8 
17. (i 
17.3 
14.5 
8.1 
7.3 
6.3 
4 

3.8 



20.2 
12.8 
9.5 
6.1 
4.9 
4.7 
4.6 
3.8 
3.5" 



16.2 
11.9 
9.5 
4.3 



1 1.8 

5.3 
3.4 



10 

6.2 
5.6 
5.2 
3.9 



22.4 
6.3 
6.1 
5.9 
4.7 
4.2 
3.9 
3.7 



U 19523 


13.5 


M22538 


8.6 


X59417 


8.4 


D84454 


7.3 


X02419 


6.4 


U5772I 


5.5 


012775 


5 


M244*6 


4.7 


L35546 


4.5 


J05682 


4.2 


D 16469 


4 
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r FABLE 1 — continued 



Description Gt:nl1mtk n) no. Change- fold 



Glycerol kinase. X68285 3.6 

FACLl L09229 3.5 

A/C'J X«0(J73 3.3 

Interferon-inducible 

ISG15 Ml 3755 22.5 

Mxl M33882 19.4 

IFldG M24594 12.1 

fNDO M34455 5.2 

GBPI M55642 4.3 

U50648 3.7 

7F/T4 1152513 3.6 

//?/54 MM6G0 3.5 

IFI58 U34G05 3.5 

IFP35 U72882 3 

Other 

Oos2 M72885 48.8 

MlHCUAPl U37546 7 2 

K/AAO/05 D 1466 1 5.1 

KIAA0U8 1)42087 5 

SNAP 23 1155936 5 

CASP5 ' U28015 4.8. 

KIAA0U3 D30755 4.8 

KIAAQ2SS D87444 4.7 

Hepatoma-deriued GP D1643I 4.7 

ITGS2 1)28235 4.6 

CLPtfJ? M3776H 4.3 

UNCI 19 homolog - U40998 4.2 

KIAAOI5I D63485 3.9 

XM035U60 3.8 

Ann win W J04543 3.7 

KIAA01W 014811 3.7 

Adrvnomedultin. D 14874 ' 3.7 

A/Af/ U83115 3.6 

K7AA026O PH743Y 3.2 

/>5-l LOB 175 3.2 

Scavenger receptor ex pressed by t;ndolhfilin! cells 1)63483 3.2 

VHL ' U 511)9 3 I 



(33). Assuming that no other multiply phosphorybited stath- 
min species had escaped detection, analysis of the integrated 
intensities of the P0 4 -stathinin and stathmin spots indicates 
that the percentage of the P0 4 form of total cellular stathmin 
increased from 11% to 38% with LPS stimulation (Fig. 2£). 
This is similar to a previous report of an increase from < 10% to 
35-40% of the Ser^-phosphorylated form in Jurkat cells stim- 
ulated with anti-CD3 (34). 

Effect of SB203580 on LPS -stimulated Gene Expression— 
Gene expression analysis of PMNs stimulated with LPS indi- 
cated that the majority of genes induced by LPS were unaf- 
fected by prior treatment of PMN with SB203580. Of the 100 
genes up-regulated by LPS, the up-regulation of 23 was inhib- 
ited by greater than 40% (Table VI). The majority of these 
genes affected by SB203580 were inhibited by less than (>()%, 
whereas only six were inhibited by greater than 80%, all of 
which represent previously identified interferon-sti inula ted 
genes. Induction of cytokine genes by LPS, with the exception 
of ILG } was generally unaffected by SB203f>80. 

Effect ofSB203580 on LPS -stimulated Protein Expression— 
Similar to the effect of SB203580 on LPS- stimulated gene 
expression, little effect, of SB203580 was seen on expression 
levels for the majority of LPS-regulated proteins (Table VU). 
Two exceptions are annexin ITT and iv-enola.se, for which LPS- 
stimulated expression was attenuated in the presence of the 
p38 MAPK inhibitor. 

Comparison of Microarray and Protcornics Results — Of the 
LPS-regulated proteins identified by peptide mass fingerprint- 
ing for which probes were present on the oligonucleotide mi- 
croarray, poor concordance was found at the inRNA level (Table 
VIII). For 13 LPS-up-regulatcd proteins, 2 corresponding 



niRNA transcripts were up- regulated, 1 was down -regulated ,.5 
were unchanged, and 5 were not detected by the A fry matrix 
chip. For 5 down-regulated proteins, 3 corresponding tran- 
scripts were down -regulated, 1 was unchanged, and 1 was not 
detected. Varying patterns of LPS regulation emerge for those 
candidates detected at both the transcript and protein level. 
Proteasome jB chain was up-regulated at both the transcript 
and protein levels (Table VTJT), with no notable effect of 
SB 203580 on expression at either level. Similarly, CAP1, Rho- 
GAP1, and ficolin 1 were down-regulated at both the rnRNA 
transcript and protein level ('fable VTII), with no notable effect 
ofSB203F>80. Annexin 111 was down-regulated at the transcript 
level and up- regulated at the protein level, with an inhibitory 
effect of SB203580 seen only at the protein level (Tables VII 
and VIII). 

DISCUSSION 

Interaction of bacterial LPS with the human PMN repre- 
sents a model system for studying the activation and output of 
the innate immune system during infection and inflammation. 
A recent publication (35) describes the gene expression changes 
of a cultured monocytic cell line after infection by the Gram - 
positive bacterium Listeria monocytogenes. The cell wall com- 
ponents of Cii'H m-positive bacteria, like Gram-negative-derivod 
LPS (i.e. from E. coli), are known to signal through TLfts (3b\ 
37). Importantly, many of the expression changes found in 
LPS-stirnulated PMNs in the present study were also described 
in the bacteria-exposed monocytic cells, indicating that many of 
the gene expression changes seen in bacterial infection are 
likely mediated by TLRs (38, 39) and that the LPS model 
system accurately reflects exposure of immune cells to in fee- 
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Tabu? II 



Hutnct.Fi. neutrophil genes repressed (^4 -fold) ctfu 


ir 4 h of /j/'iS fixposxre. 




Description 


OenBank™ rut. 








'fold 


Kinases 






CAMK, 11, gam ma 


U50360 


-4 


Oiacylglycerol kinase, delta 


D6347B 


-4.2 


PRKCL2/VnK2 protein kinase C-like 2 


U33062 


-4.3 


MAPKAPK3 


U09578 


-6.3 


Protein kinase Ht31. cAMP-iltspeude»t 


nG2167-HT2237 


-8 


CAMK if 


1,07044 


-9.8 


Transporters . 






SLC25A5 /so\uie carrier family 20, m amber 5 


J02683 


-4.2 . 


SLCWAl; folate transporter 


U 17566 


4.4 


SLC2A3; facilitated glucose transporter 


M 20 681 


-5 


Metabolic 






Carbonic anhydrase /V 


1.10955 


-4.4 


RNaae A family, h6 


U64998 


-4.5 


Glycogen p hos p ho ry laser, liver 


M 14636 


-4.6 


Inos itolp clyphosp h ate - 5 p hospha tase 


U67650 


4.6 


Inositol 1,3,4 -trisphosp ha 1 e 6/G -kirias a 


U51336 


-4.7 


Transkctolasc 


L127I 1 


-4.8 


Protein phosphatase 4, rug. subunit 1 (clone 23840) 


U79267 


- 4.9 


Cytidine deaminase 


L27943 


-5.4 


MGAT1 


M5S62I 


-5.4 


HMOXl 




--■5.4 


MAN2A2 


L28821 


-5.8 


Glyr.ogenin (also represents 113 1525) 


HG4334-HT4604 


-5.9 


Structural 






Kibrinogen-like protein (pT49 protein) 


Z3B531 


■ 4.2 


I12AFZ 


M37583 


-4.7 


Paxillin 


U 14588 


-4.9 


uamin is rf 


Tj2593 1 


— *>.y 


Dytiarnin 2 


UG983 


•6.2 


Actinia I 


M 95 1 78 


-6.7 


a-Tubuliti 


X01703 


- 10 


Tubulin, txl, isufurrn 44 


HC!2259-HT2348 


- 15 


Transcriptional regulators 






Lymphoblastic leukemia-derived sequence J 


M22638 


-4.4 


MAX interacting protein I 


L07648 


4.5 


Nuclear factor crythroid 2 isofnrm f 


S77763 


-6 


Transducer of ERISB2. 1 


0381105 


■6.9 


NFATC4 


L41067 


-7.8 


ATP.9 tCTiF-Tlni*) 


L055 J 5 


-9.6 


Receptors 






Lyrnphatoxin 0 receptor 


1.04270 


4.4 




U0847 1 


- 5 


Ul 1875 


-5.3 


Sitfnalintf 






Pix-u; cool -2 (KIAAOfldfi) 


1)25304 


-4.5 


ARHB/lihoB 


M12174 


-4.5 


TNFSFIO; TRAIL 


U3751R 


•6.6 


Ca 2) binding 






ANXIl 


L 1960S 


■4.3 


SW0A4 


M 80563 


-4.8 


AN XI 


X05908 


-4.8 


Other 






Proteolipid protein 2 


1,09604 


-4.9 


Protein phosphatase /, a catalytic Bubunit 


HG1614-HTJ614 


-5 


TIMP2 


MS 2304 


-5,1 


KIAA0199 


D837B2 


-5.2 


Lipin. 2 (KIAA0249) 


U87436 


-5.6 


LRMP(Jqw^) 


U 10485 


-5.8 


CVGBP2 


U69546 


•6.9 


Clone 23933 


U79273 




PECAM1 


L34GS7 


-8 


Delta deep - r ndu ting pep t i de 


7,50781 


-8.7 


DiQenrge synd. critical region, gene 2 (K1AA0163) 


079985 


•9 


SELPLG; CD 162; seloctm P iigand 


U 25956 


• 32 



tion. Nevertheless, the reliance upon DNA micruarruys alone cation of both DNA micron cray and prateomics technology to 
affords insight only into the transcriptional response without our model system provides unique insight into both the cellular 
corroboration at the protein level. In the present study, appli- biology of the activated FMN and the responsiveness and reg- 
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Fio. 1. Two-dimensional PAGE of LPS-exposed human PMNs. 
A and B, colloidal Coomassie Blue-stained pH 3.0-10.0, two-dimen- 
sional PAGE gels (A, control; fl, LPS-exposed) with up-regulated {solid 
arrows) and down -regelated (hatched an-ows) proteina indicated. These 
results are representative of six separate experiments. C and D, colloi- 
dal Coomassie Blue-stained pH 5.0-G.0, two-dimensional PAGE geLa 
(C, control; D t LPS-exposed) with up-regulated (solid arrvws), new 
(solid. afTow, open arrowhead), and down- regulated (hatched arrows) 
proteins indicated. LPS-exposed PMNs from three blood donors were 
pooled. 

illation of its transcriptional and trnnsiational machinery. As 
will be discussed below, our study identifies, in particular, 
novel aspects of the LPS -stimulated PMN transcriptional reg- 
ulation, activity in the innate immune response, signaling, 
cytoskeletal reorganization, and priming for granule release. 

In the present study, the increase in NF->cR transcript abun- 
dance (Table I) detected by the microarrays corroborates the 
findings of other studies of PMNs and monocytes (40) and 
indicates a mechanism for the responsiveness and scope of the 
PMN transcriptional machinery following LPS exposure, NF- 
kB, recently described to be activated by LPS through the 
TLR/MyD88/interloukin-l receptor-associated kinase pathway 
(1, 4), is the only transcriptional complex reported to be in- 
duced by LPS in the PMN, However, because the transcrip- 
tional NF-kB complex has been implicated in the regulation of 
only a portion of the genes induced by LPS in this study (data 
not shown), the importance of alternative transcriptional reg- 
ulators in the PMN is clear. Of interest, several other known 
and putative transcriptional regulators with less well defined 
functions were also up- regulated in the present study, includ- 
ing PLAGL2, a putative zinc-finger protein, XHPl, MVFt, 
Ets 2, B ATF, and DIF-2. On the other hand, LPS-down-regu- 
lated genes include ATF-2 (a known target of p38), NFATC4, 
TOB I, NF-B2, MXl-1, and LYL-P Although the exact role of 
these gene products in regulating cell function is unknown, 



these data indicate that the range of transcriptional responses 
in the LPS-stimulated PMN is much broader than previously 
suggested and that the signaling capabilities of the PMN in the 
immune response are thereby likely extended in scope and 
specificity. 

As expected from the literat ure, the 'genes for several cyto- 
kines and cheinckines, including IL-lfi, fL-6, and M1P-1&, were 
found to be up-regulated (Table I). On the other hand, the 
notable absence of up- regulated cytokines in the proteomics 
experiments rellects their removal in the post- LPS incubation 
wash performed prior to lysis for two-dimensional- PAGE. Up- 
regulation of these inflammatory mediators is well documented 
in PMNs exposed to LPS and in animal models of LPS- induced 
sepsis syndrome and acute respiratory distress syndrome, a 
PMN-mediated illness (41., 42). Several genes in this family 
were up -regulated that hove not, to our knowledge, been de- 
scribed in LPS-stimulated cells, including MCP-1, GR03, 
IL-lORAi and HM74, an orphan G protein-coupled receptor 
with homology to chemokine receptors. The down- regulation of 
TNFSF1Q, lyrnphotoxin b receptor, and TNFAIPl were also 
observed. The modulation of genes involved in cytokine signal- 
ing, including the adapter molecules TRAFJ (LPS and TNF 
receptor signaling) and TNFAIPl (TNF receptor signaling) and 
several kinases and phosphatases, may indicate a change in 
cytokine responsiveness after LPS treatment. Relevant in this 
regard from the proteolytics data arc: 1) the up-regulation of 
protein phosphatase i, which has been shown to regulate PMN' 
NADPH oxidase activation and translocation (43, 44) and to 
regulate LPS-induced NK-kB activation (45); 2) the down -reg- 
ulation of Hho-GAPl, which has been shown to regulate 
NADPH oxidase activity in the PMN (46); and 3) the up- 
regulation of P0 4 -stath.rnin (Table TV), a phosphoprotein pos- 
tulated to function as a relayer and integrator of multiple 
signal transduction pathways (34). Several noncytokine, 
nonchemokine genes involved in the immune response were 
also up-regulated, including the complement pathway mem- 
bers C3, C3AR1, and PFC; the protease inhibitors EUiNH2 
(elastase inhibitor), SLPl, PI -3, and PJ-9\ and the acute phase 
protein oroso mucoid. LPS regulation ofC3ARl and orosoinu- 
void, expression have not previously been reported. In the pro- 
teornics experiments, the down-regulation of ficolin-1 (Table III), 
a collectin-iike celt surface protein reported to activate the com- 
plement system and to mediate adhesion and phagocytosis in 
monocytes hut not previously reported in granulocytes (47), may 
represent negative modulation of the innate immune response. 
Hie finding that genes other than cytokines and chemokines are 
regulated by the PMN in response to LPS indicates that the PMN 
plays a more sophisticated role in host-defense and immunity 
than previously thought. 

Treatment of the PMN with LPS lead to the induction of a set 
of genes associated with the anti-viral Type 1 interferons, 
IFNoY/3. 'lias induction occurs independently of the release of 
IFN or another unidentified soluble factor. 2 Furthermore, the 
set of genes expressed is smaller than that induced by I.FNo/0, 
as described by Beret ai (12). This may he due to differences in 
the scope of the signaling systems activated by LPS and 
IFNry//-;, or the time course of analysis of genes in the LPS- 
stimulated PMN. The implication that LPS treatment of PMN 
allows PMN to express anti-viral activity is currently being 
tested. Of interest was the finding that induction of interferon - 
stimulated genes was blocked by pre treatment of PMNs with 
SB203580. Work from our laboratory has indicated that signal 
transducers and activators of transcription activation does not 
occur in response to LPS in PMNs. 2 In addition, interferon- 



K. C. Malcolm find G. S. Worthon, manuscript in preparation. 
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Tahlk ITT 

Analysis of phi 3.0-10.0 two- dim cnsionul PAGE gels 
Mean change(-fold) in expression level among six PMN donorfi is reported. The change in expression for the proteins listed was statistical))' 
significant (p < 0.05) as measured by a two-tailed Student's f test. 

Identification (spot no.J 



Up -regulated 
Proteasomo £ chain [646] 
Annexin Til (5001 
Actin fragment- [544]° 
Actin Augment [6911" 
u-Eiiolase (380] 

Rab-GDP dissociation inhibitor fs (289) 
Glutathione S-trnnsfcraae P (6481 
Prc-B-ccIl colony enhancing factor (1 152} 

Doton-regulated 
Adcnyiyl cyclase -associated protein 1 (2GBJ 
Rho-GAPl [283] 
Pieolin 1 15 PI 



Swiss- Pro t no. 


EKtiniHtml 

MM 


'I'litmruticrtl 


PflJJtitjtJS UlHtchtid/ 

submitted 


Protei n 
covered 


change 








% 


% 


•fold 


P28070 


27/5.7 


29.2/5.72 


9/12 (75%) 


36% 


1.51 


PI 24 29 


31/5.7 


36.4/5.6 


14/18(78%) 


42% 


1.37 


P02570 


32/5-5 


(41.7/5.29) 


13/15 (87%) 


(34%) 


1.74 


P02570 


30/5.4 


(41.7/5.29) 


14/18 (78%) 


(29%) 


1.60 


P06733 


41/5.7 


47.2/7.01 


9/10 (90%1 


24%» 


!.G5 


P50395 


50/6. 1 


60. 7/6.. 1 1 


10/1 1 (91%) 


25% 


1.24 


P0921J 


23/5.5 


23.4/5.43 


6/8 (75%) 


41% 


1.54 


P43490 


53/7.0 


fi5.5ye.n9 


12/lfi (75%-) 


25% 


1.29 



Q01G18 
Q07960 
000602 



55/7.3 
50/5.8 
33/6.5 



51.7/8,07 
50.4/5.85 
35/6.39 



16/22 (73%) 
7/9 (7H%) 
10/12 (83%) 



34% 
22% 
25% 



0,53 
0.67 
0 74 



° The theoretical pi and M n of native actin are indicated Protein coverage indicates coverage of native aci.i 



Table TV 

Analysis of pH 5.0-6.0 two-dimensional PAGE gels 
Results are lrorn pooled samples for control (n"3) and ] PS-exposed (a :: 3) PMNs from human donors. Expression of the reported proteins was 
altered > 1.5-fold following LPS exposure in two repeat experiments. "New'" designates proteins seen in the LPS gel in two repeat experiments but 
not detectable in the corresponding control gels. 

Identification lepot nod 



Up-regulated 
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14% 
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30% 


O.o 



° TrEMBL accession number. 

h Accession number and theoretical pi and M H for the unmodified protein are indicated. 
v rVCBl accession n\unl>er. 
rf Sec text for explanation. 

c Among three experiments, the ratio of P0 4 -sUithmin expression increase, following LPS exposure in the presence, of SB20358U divided by that 
in the absence of SB203580, was 0.93. 
^Genpept accession number. 

* This search was performed using average masses measured by linear mode fa A LOT-TOP MS 

Tauuc V 

Analysis of pli 5.5-6,7 two-dimanswnal PAGE gals 
Results are from pooled samples for control (a - 3) and LPS-nxposed (n ™ 3) PMNs from human donors. Expression of tine reported proteins was 
altered ■:> 1.5-fold following LPS exposure in two repeat experiments. 
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Q 9943 9 
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10/11 (90%) 


27% 


0.5 



regulatory factor 3, a known regulator of in terferon-sti inula ted 
gene transcription, is not a direct target of p38 kinase. 2 There- 
fore, gene expression analysis of LPS-stimulated PMNs has 
uncovered a previously uncharacterized signal transduction 
system that is sensitive to inhibition of p38 MA.PK. 

Knowledge of the genes down-regulated by LPS permits the 



development, of further hypotheses addressing PMN function in 
the face of infection. Strikingly, several down -regulated genes 
and gene products are structural in nature (e.g. pax ill in, ueti- 
nin, calpunin H2) (Tables II and V). A known consequence to 
the PMN of LPS ex'p° s,jre is decreased motility (48). Up- regu- 
lation of genes for adhesion molecules (/CAM- 7, CD44, AL- 
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F]G. 2. A, the predicted sequence of the tryptic phosphopeptide in 
F0 4 -KtaLhmin (1468.72 Da>. The peptide mass measured by MALDl- 
TOF MS and the predicted mass differed by 14 ppm. As indicated, two 
alternate phosphorylation sites are possible: serine 16 and serine 25. D, 
P0 4 -sUthmm and stathmin were identified on the control and Un- 
exposed pH 5.0-6.0 gels. Consistent with phosphorylation, the PO<- 
stathmin spot was distinguished by a peptide of mass 1408.72 D» {Ik. 
80 Da greater than the peptide af 1388.72 Da senn in the stathmin spot). 
Assuming that no other multiply phosphorylated stathmin species have 
escaped detection, analysis of the integrated inton-sities of the V0 4 - 
stathmin and stathmin spots indicates that the percentage of the P0 4 
form of total cellular stathmin has increased from 1 \% to 3 8% with LPS 
stimulation. The decrease in integrated intensity for stathmin was 
equal in amount to the increase in I J 0 4 -sUithmin following LPS 
exposure. 

CAM, and TSG-6), and down -regulation of genes for structural 
proteins, indicates u genetic basis for this observation. Down- 
regulation of two genes implicated in cytoskeletal regulation, 
Pix'fr and RhoB, was also observed. The calcium -bin ding pro- 
tein S100A4, down-regulated in LPS-treated PMN a (Table U), 
has been implicated in cell motility and metastasis (49). De- 
creased motility may be beneficial in sustaining the inflamma- 
tory response at sites of infection. In addition, LPS treatment 
results in an inhibition of apoptosis (50'). Therefore, the longer 
residence time of the PMN at sites of infection is consistent 
with the long term genetically coded changes seen in these 
gene- profiling experiments and indicates that the changes in 
gene expression are functionally relevant to host defense and 
immunity. 

By providing information on post-translational modification, 
the protcomics data may provide further insights into the cy- 
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Effect of SB20358Q on LPS- stimulated gene, expression 
(Joints are reported for which the SB203580/control expression ratio 
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toskeletal remodeling effects of LPS upon the PMN. We con- 



tend that the actin fragments identified (Table III) are unlikely 
to represent technical artifacts. Rather, their specificity (iden- 
tical molecular weight/pi among different, experiments), statis- 
tically significant up- regulation by LPS, as well as the ut>e of a 
lysis buffer containing chaotropes and multiple protease inhib- 
itors argue instead that these fragments are physiologic con- 
sequences of LPS exposure in the human PMN. More specifi- 
cally, the up- regulation of these fragments following LPS 
exposure (Table Til) suggests that LPS may activate an actin- 
cleaving enzyme, which, in turn, remodels the cytoskeleton. 
Intriguing in this vein, calpnin has recently been reported to 
play an important role in cell migration and cytoskeletal orga- 
nization of fibroblasts (51). The possibilities that LPS may 
induce calpaiu activation and that calpain activation may reg- 
ulate cytoskeletal reorganization and motility are currently 
under investigation. An alternative possibility is that actin 
cleavage is a marker of neutrophil apoptosis (D2). 

Other LPS-regulatcd proteins may play important roles in 
cytoskeletal reorganization. The up -regulation of protein- ty- 
rosine kinase 9-1 ike (A6-related protein) may modulate LPS- 




31.300 LPS -activated Neutrophils: 
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LPS-regulatcd protein a for which a probe was present on the 
Affy matrix chip 

A comparison of corresponding protein and mRNA transcript changes 
following LPS exposure is shown. 
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induced actin polymerization, because it bears a high degree of 
homology to fcwinfilin (AG), an actin inonomer-binding protein 
that local i/.es to sites of rapid filament assembly in cells and is 
believed to regulate nctin filament turnover (53). In turn, LPS- 
induccd down-regulation of Rho-GTPase activating protein 1 
(Table III) may regulate twin film (and protein -tyrosine kinase 
9-like) activity, because twinfilin has been shown to colocalize 
with Racl and Cdc42 and to be regulated by active Rai l in NTH 
3T3 cells (b'3). Activation of Rho proteins may be facilitated by 
LPS up-regulation of i noes in (Table V), because moesin report- 
edly induces the dissociation of Rho from GDI (54). Rod may, 
in turn, promote activation of* the actin filament-nucleating 
Aip 2/3 complex through interactions with WASP (Wiskott-Al- 
drich syndrome protein) family proteins (55) and, interestingly, 
is postulated to regulate the dynamics of both the actin and 
microtubule cytoskele tons via phosphorylation of stathmin (Ta- 
ble IV) (56). Calponin H2 is an aetin-binding protein not pre- 
viously reported in PMNs that is postulated to play n role in 
cytoskeletal organization (57). Its down -regulation by LPS (Ta- 
ble V) likely modulates LPS-in.du.ced cytoskeletal reorganiza- 
tion. The up- regulation of nonrnuscle myosin heavy chain and a 
putative phosphorylatcd form of myosin heavy chain (putative 
protein kinase C substrate by prediction rules) in the LPS- 
exposed PMN (Table TV) is of uncertain significance; myosin 
has been implicated in multiple functions in the PMN, includ- 
ing locomotion, fluid pinocytosis, and phagocytosis (58). Of 
interest, however, S100A4 (down-regulated, Table II) has been 
reported to regulate cyto skeletal dynamics by inhibiting pro- 
tein kinase C-mediated phosphorylation of nonrnuscle myosin 
heavy chain (59). 

LPS induction of stathmin phosphorylation (Table TV and 
Fig. 2) may represent another mechanism by which the cy- 
toskeleton is remodeled. Stathmin is a phosphoprotein report- 
edly involved in both signal transduction and in regulation of 
the microtubulin filament network; furthermore, phosphoryla- 
tion of stathmin has been reported to modulate its tubulin - 
binding avidity (60). Inferences can be made about both the 
phosphorylation site on P0 4 -stathmm and the responsible ki- 
nase induced by LPS. Four phosphorylation sites in stathmin 
have been well described: Ser 10 , Ser* Ser 38 , and Ser 63 (32, 33). 
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Ser 10 has been reported as a substrate for Ca 2 * 1 /calmodulin 
(CaM)-dependent kinases (32), and Ser 2b as primarily a sub- 
strate for p38 and ERK (33), with p34 cil( ^ also active but bear- 
ing a 6- fold preference for Ser 3S (34). As stated above, the. 
phosphopeptide identified in P0 4 -stathmin, extending from 
residues 15 to 27 (1468.7 Da), is consistent with phosphoryla- 
tion of either Ser 10 or Ser 5 *"' (Fig. 2). Although both p38o and 
p38tt MAPK isoforms are expressed in the human PMN, LPS 
has been shown to selectively activate the p38a- isoforrn in 
human PMNs (9). The p38c* isoforrn, however, has been shown 
to be relatively inactive at Ser ft,s ; in fact. p38? is — 100-fold more 
active at Ser 25 , and selective; p38a inhibitors do not inhibit the 
stress-activated phosphorylation of stathmin in 293 cells (33). 
Further support for the lack of involvement of p38 signaling in 
phosphorylation of stathmin in our system is the apparent lack 
of effect of SB203580 (a selective p38n- and p380 inhibitor) on 
LPS-induced expression of PO^-stathmin (Table IV). Because 
p34«te2 i s relatively inactive at Ser 2,,i (34), we conclude that the 
phosphorylation site is likely to be Ser ut , a reported substrate 
of CaM -dependent kinase. Although CaM kinases have previ- 
ously been implicated in gene activation in LPS-exposed my- 
elomonocytic HD11 cells (61), stathmin signaling has not, to 
our knowledge, been previously reported in either PMNs or 
lipopolysaccharide signal transduction. 

Cytoskeletal reorganization, n well-described regulator of 
granule release (62), may underlie LPS-induced priming for 
PMN granule release, but several LPS- regulated proteins may 
provide 'more specific clues. LPS exposure led to increased 
levels of grancalcin, a calcium-binding protein previously de- 
tected in PMNs and shown to trans! ucute to granules and. 
plasma membrane in the presence of physiologic concentra- 
tions of calcium (63). Similarly, annexin III, a calcium-binding 
protein highly expressed in PMN granule membranes and im- 
plicated in calcium- mediated secretion (64) and in granuJo fu- 
sion (65), was also found to be up-regulated. Exocytosis of 
granule contents may also be facilitated by LPS up- regulation 
of Rab-GDP dissociation inhibitor (Table III), which has been 
proposed "to recycle Rab after vesicle fusion by extracting it 
from the membrane and loading it onto" newly formed transport 
intermediates (6b*). 

Parallel use of DNA microarrays and proteomics affords a 
powerful strategy for comparison of corresponding mliNA tran- 
scripts and proteins, thereby affording new insight into the 
mechanisms by which the cell regulates its signaling responses 
to the external environment. Of interest, a poor correlation was 
found between corresponding transcripts and proteins (Table 
VTTT), as reported in other systems (17, 18). The finding in some 
'cases of unchanged transcript abundance in the face of regu- 
lated protein levels indicates post- transcriptional modulation 
following LPS exposure. The finding of undetected transcripts 
in the face of regulated levels of the corresponding proteins 
may indicate previous transcription of these genes in an earlier 
state of the myeloid maturation of the PMN, producing stable 
protein species that have undergone post-translational alter- 
ation following LPS exposure. The. use of SB20358O, a p38 
inhibitor, adds further insights into the mechanisms of LPS 
regulation. At the level of mRNA expression, SB20358U inhib- 
ited 23% of LPS-stimulated genes by >40% and 1 \ % of genes 
by £60 c &; therefore, p38 plays a specific role in gene regulation 
in the PMN. In particular, proteasome )3 chain whs up-regu- 
lated at both the mRNA transcript and protein level (Table 
VTII), with no notable effect of SB 203580 on expression at 
either level, consistent with a non-p38-mediated pathway of 
primary transcriptional up-regulation induced by LPS. Simi- 
larly, CAP1., Rho-GA.Pl, and ficolin I were dowiirregulated at 
both the mRNA transcript nnd protein level (Table VIII), with 
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no notable cfiect of SB203580, consistent with a non-p3S- me- 
diated pathway of primary transcriptional down -regulation. 
Interestingly, annexin HI was down -regulated at the transcript 
love) and up- regulated at the protein level, with an inhibitory 
effect of SB2035B0 seen only at the protein level (Table Vll), 
consistent with a p38-rnediated post-tranacriptional up-regu lo- 
tion induced by LPS. 

Limitations of the present study should be noted. Gone ex- 
pression analysis by cDNA microarrays does not distinguish 
between transcriptional regulation and mRNA stabilization; 
similarly, two-dimensional PAGE proteomics by itself does not 
distinguish among transcriptional, tra relational, or post-trans- 
lational regulation of protein abundance. Transcript detection 
by micro-array technology is limited to the probes included; 
protein identification by two-dimensional PAGE proteomics is 
limited to well-resolved regions of the gel, may perform less 
well with hydrophobic and high molecular weight proteins, and 
tends to select for more abundant protein species (30). Harvest- 
ing of the LPS-incubated PMNs at 4 h may have prevented 
detection of earlier, transient changes and may have thereby 
introduced a rti factual transcript- protein discordance. Further- 
more, the post-LPS incubation, pre- two- dimensional PAGE cell 
washes would be expected to remove secreted proteins from 
further analysis, with uncertain effects on detected protein 
abundance depending on such factors us the degree of de novo 
synthesis and extent of degranulation/exocytosis. Because pro- 
tein binding of Coomassie Blue has a limited dynamic range 
and is typically not linear throughout the range of detection, 
image analysis of Coomassie Blue -stained protein spots should 
be considered semi-quantitative. For some protein spots, the 
apparent magnitude of regulation by LPS may have been 
blunted by the spot approaching staining saturation in the 
control gel. By limiting our analysis to those protein spots 
common to all twelve pH 3.0-10.0 two-dimensional gels, we 
likely excluded some LPS -regulated proteins that happened to 
be either poorly resolved on a subset of the gels or unmatched 
by the image analysis software. By further limiting the analy- 
sis to those matched spots on the pH 3,0-10.0 gels for which a 
two-tailed / test demonstrated p < 0.05, the list of regulated 
proteins was likely also limited by statistical power. In addition 
to those regulated proteins listed in Table III, three others were 
up-regulated and three down-regulated with p < 0.09 (data not 
shown). 

Limiting our reported results to those changes that met 
statistical significance among the donors carries further impor- 
tant implications. We have encountered a two order of magni- 
tude range of response in unsclected donor LPS-indueed PMN 
functions, such as TNF-« and superoxide anion release (data 
not shown). The sources of this physiologic heterogeneity re- 
main uncertain but may possibly include such factors as nat- 
ural mutations of the LPS receptor component, TLR4 (67). By 
selecting for LPS effects common to all donors, we may not have 
characterized the range of genomic and proteomic heterogene- 
ity present in the population and thereby may have focused on 
only a narrow portion of a broader biological response to LPS. 
We contend that this reductionist approach is valid because it 
would be expected to enrich for biologically integral responses 
of the PMN to LPS. Nevertheless, correlation of genomic and 
proteomic profiles with functional phenotypes of the PMN may 
bear important diagnostic and therapeutic implications and 
will be pursued in future studies. 

Widespread regulation of numerous noncytokine/chemokine 
genes and proteins in the LPS-stimulated human PMN is a 
novel finding. These data indicate that, despite a narrow scope 
of gene expression in the nonstimuluted state, the terminally 
differentiated, short-lived PMN likely plays a role in the innate 



immune response that is far more sophisticated and dynamic 
than the simple release of preformed in tin minatory mediators. 
Although gene expression appears to be an important mecha- 
nism by which PMNs respond acutely to infection, mRNA tran- 
a crip (/protein concordance is limited, and post- transcriptional 
(and post-translatioaal) modifications also play an important 
role. The alteration of multiple transcriptional regulators, G- 
protcin regulators, PO. r stathmin, and protein phosphatase 1 
indicates that one of the responses to LPS exposure is to modify 
subsequent signaling events by bacterial components or by 
other cytokines arid chemokines. Finally, the finding that p38 
MAPK mediates LPS regulation of a limited subset of trail 
scripts and proteins underlines the continuing need to define 
signal transduction cascades in the neutrophil. 
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